Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3996 |
Symbol | |
ID | 4694094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 4384372 |
End bp | 4387311 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639851745 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_998721 |
Protein GI | 121610914 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.290267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0213209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCCGTT TCCTGTCCCG CGAGCCTGTG CACGATCCGC TGTCGGTGGC CGCGGCCCAT GCCGCAACCG AGGTCAAGAC CACCACCTGC TACATGTGCG CCTGCCGCTG CGGCATCCGC GTGCACCTGC GCGCAAGCGA CGAAGGCCCG CAGTTGCGCT ACATCGACGG CAACCCCAGG CATCCGCTGA ACCAGGGCGT GATCTGCGCC AAGGGCGCGT CGGGCATCAT GAAGCAGTTC TCGCCCGCGC GCATCACGCA GCCGCTGCTG CGCAAGGCCG GCAGCGAGCG CGGGGCCGGG GAGTTCGAGC CCATCAGTTG GGAGCGCGCC TACGACATGC TGACCGAGCG TCTGGCCCGC ATCCGCGCGA GCGACCCGAA GAAGTTCGCG CTGTTTACCG GCCGCGACCA GATGCAGGCG CTCACCGGCC TGTTCGCGCG CCAGTTCGGC ACGCCCAACT ATGCTGCGCA TGGCGGCCTG TGCTCGGTCA ACATGGCGGC GGGAATGATC TACACCATCG GCGGCAGTTT CTGGGAATTC GGCGGCCCCG ATCTGGAGCG CGCCAAGCTG TTCGTGATGA TCGGCACCGC CGAAGACCAC CACAGCAATC CGATGAAGAT TGCGCTTGGC AAGTTCAAGC GCGCCGGCGG GCGCTTCATC GCGATCAACC CGGTGCGCAC CGGCTATGCG GCCATTGCCG ACGAGTGGAT TGCAATCAAG CCGGGCACCG ACGGCGCGCT GTTCATGGCG CTGCTGCACG AGTTGATCGC CGGCGAACTG ATCGACCATG CATTCCTGCA GCGCTTTACC AATGCGCCGC AATTGGTGCT GCTCGATGAC GGCGAGCGCC AAGGGCTGTT CGCCTTCGAT CCCGAGCGCG GCCCGCCGGG CGATGGCCGC AATCCGCACA ACAAACTGGT CTGGGACAAG CGCAGTCGCA GGGTGCTGCC GGCCTACCCC GAAGGCATTG CCGAGGGCTG CGATCCGGCC CTGGAAGGCC ATTACCGGCT GGCCGATGGC ACGCGCGTCG CGCCCTCGTT CCAGTTGCTG CGCGAGCGTG TGGCCGACTG CACGCCCGAG TGGGCGCAGG CCATCACCGG CATCGACGCT GCGCGCATCC GCCAGTTGGC GCGCGAACTG GGCGAGACGG CGCTGCGGCA GGCTTTCGAT CTGCCCATTG CCTGGACCGA TGCCTGGGGC AAGCAATACC CGGGCACGCG GGCGCGGCCG GTGGCCTTGC ATGCGATGCG CGGCTTGGCG GCGCATTCCA ATGGTTTTCA GACCGTGCGG GCGCTGGCGC TGCTGATGAG CGTGCTCGGC ACCATCGACG CGCCCGGGGG CTTTCGGCAC AAGGCGCCGT ACCCGCGGCA TATCGTGCCC AACTACCGCG CCTTCAACAC GCCGGCGATG ATGCAGCCGG ACACGCCGCT GAACGCGGCG CCGCTGGGTT TTCCGGCCAG CCCGCAAGAA CTGGCGGTCA ACCCGGACGG CTCGCCCATC CGCATCGACC ATGCTTTCTC GTGGGAGCAC CCGCTGTCGG TGCACGGGCT GATGCACAAC GTGATCACCA ACGCGGTCAG GGGCGACCCG TACCGCATCG ACACGCTGCT GATCTTCATG GCCAACATGG CCTGGAACTC CAGCATGAAC ACCATGGGCG TGCGCGAGAT GCTCAACCGC AAGGACGACA AGGGCGAGCA CATGATTCCC TTTCTGGTGG TGTGCGATGC CTTCCAGAGC GAGACCGTGG CCTTTGCCGA CCTGGTGCTG CCCGATACCA CCTACCTCGA ACGCCATGAC GTGATGAGCA TGCTCGACCG GCCGATCTCG GAATTCGACG GCCCGGTCGA TTCGGTGCGC GTGCCGGTGC TGCCGCCGAC GGGGCAGTGC CGGCCCTTCC AGGAGGTGCT GATCGAGTTG GCCTCGCGGC TGAAGTTTCC GGCTTTCACC ACGCCCGAGG GCGGGCGCAA GTTCAGCAGC TACCCGGACT TCATCGTCAA CTACGAGCCG CAGCCGGGCA TCGGCTTCCT GATGGGCTGG CGCGGCAAGG ACGGCAGCGA GCATCTGCGC GGCGCGCCCA ACCCGCAGCA GTGGCAGGCC TATGCGCAAA ACGACTGCGT GTTCCGCCAC CACATGCCCG AGAGCATGCA CTACATGCGC AACTGGAACC GCGAGTACCT GGACTTTGCC AAAGACAAGG GCTGGCGCCA GCGCAACGAC CCGGTGCAAC TGGCGCTGTA TTCCGACACG CTGCAGAGCT TCCGGCTGGC CGCGCAAGGC AAGAGCACCG GGCGGCAGCC GCCCGAGGCG CTGCGCGAGC GCATCGCCAC CTACTTCGAT CCGCTGCCCT TTTGGTATCC GCCGCTCGAA GATGCAAGCA CCGACCTGGC CGCCTACCCG CTCAACGCGA TCACGCAGCG GCCGATGGCG ATGTACCACT CGTGGGACTC GCAGAACGCC TGGCTGCGGC AGATCCATAG CCACAACTAC CTGCATGTGA ACCCGCTGAG CGCGCAGGCG GCCGGCATTG CCGACGGCCA CTGGTGCTGG GTCGAGAGCC GCTGGGGCCG GGTGCGCTGC ATGTTGCGCT ACAGCGAGGC CGTGGAGCCG GGCACGGTGT GGACCTGGAA CGCGATCGGC AAGGCCGATG GCGCCTGGCG GCTTGCGCCC GGTTCGGACG AGGCGCGCAA GGGCTTTTTG CTGAACCACC TGATCAGCGA GGAACTGCCG TTTGCCGGCA GCGCCAGCGC GCACATCAGC AATTCTGACC CGATCACCGG GCAGGCCGGC TGGTTCGACG TGCGCGTGCG CATACGGCCG GTGGAACCCG GCGAGCCTGG GCAGAGCTTT CCGCAGATCG CCAGCATGCC GGTCGTGCCC GGCGTGCTCG GCCAGGCAGC CCATGTGCTG CGCTACTTTG CCGGGCGGGG CCGGCAATGA
|
Protein sequence | MFRFLSREPV HDPLSVAAAH AATEVKTTTC YMCACRCGIR VHLRASDEGP QLRYIDGNPR HPLNQGVICA KGASGIMKQF SPARITQPLL RKAGSERGAG EFEPISWERA YDMLTERLAR IRASDPKKFA LFTGRDQMQA LTGLFARQFG TPNYAAHGGL CSVNMAAGMI YTIGGSFWEF GGPDLERAKL FVMIGTAEDH HSNPMKIALG KFKRAGGRFI AINPVRTGYA AIADEWIAIK PGTDGALFMA LLHELIAGEL IDHAFLQRFT NAPQLVLLDD GERQGLFAFD PERGPPGDGR NPHNKLVWDK RSRRVLPAYP EGIAEGCDPA LEGHYRLADG TRVAPSFQLL RERVADCTPE WAQAITGIDA ARIRQLAREL GETALRQAFD LPIAWTDAWG KQYPGTRARP VALHAMRGLA AHSNGFQTVR ALALLMSVLG TIDAPGGFRH KAPYPRHIVP NYRAFNTPAM MQPDTPLNAA PLGFPASPQE LAVNPDGSPI RIDHAFSWEH PLSVHGLMHN VITNAVRGDP YRIDTLLIFM ANMAWNSSMN TMGVREMLNR KDDKGEHMIP FLVVCDAFQS ETVAFADLVL PDTTYLERHD VMSMLDRPIS EFDGPVDSVR VPVLPPTGQC RPFQEVLIEL ASRLKFPAFT TPEGGRKFSS YPDFIVNYEP QPGIGFLMGW RGKDGSEHLR GAPNPQQWQA YAQNDCVFRH HMPESMHYMR NWNREYLDFA KDKGWRQRND PVQLALYSDT LQSFRLAAQG KSTGRQPPEA LRERIATYFD PLPFWYPPLE DASTDLAAYP LNAITQRPMA MYHSWDSQNA WLRQIHSHNY LHVNPLSAQA AGIADGHWCW VESRWGRVRC MLRYSEAVEP GTVWTWNAIG KADGAWRLAP GSDEARKGFL LNHLISEELP FAGSASAHIS NSDPITGQAG WFDVRVRIRP VEPGEPGQSF PQIASMPVVP GVLGQAAHVL RYFAGRGRQ
|
| |