Gene GM21_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0791 
Symbol 
ID8136107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp940316 
End bp944485 
Gene Length4170 bp 
Protein Length1389 aa 
Translation table11 
GC content53% 
IMG OID644868409 
Producthypothetical protein 
Protein accessionYP_003020623 
Protein GI253699434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones112 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATGCT ACAAAGACTA TTTCGGGATC AGGCCTGATT ATGCGCCGTG CATGACACTC 
GCGGACATAA ATAAGACACC GGAGACCTGG CTTGGATTCT ATCCGCACGA TTCGTTCGTG
GATATTTTGC GTTTGCTGCT CAAGAGTTTA GAGGGCGGAA ATAAGACGCT GTGGATCACG
GGAGCGTATG GCACTGGCAA GAGCCATGCG TCGCTGGTTC TGCAAAAACT GTTCACGGAT
GACGATGCAC GCGTCCAGAA ATGGCTCGCG CTACGCAAAG CGCAGATCCC GGAACCCGTA
CGTGAGGGGC TGCTGGCGCG GCGCGGGGAA AAAACGCTTG TCGTCTATGA TGTCAACGCC
GACGGTGTTG ACGCGAAGAA TCAGTTCCTA ATGCGCTTGC AGCGCGGTAT CACAAAAGCA
CTGGTCTCGG GGGGGCACAC GATTCCCCTG AAAGGCAAAC TCGATGAAGT TATTGAGCGC
ATCCGGCAGG ACGAGCCGTA TTTCTTTGCA AAACGCGACA CGATGCAGGC GCGGCTTTCG
CACCTGAACG CGGGTATCAA AACAGCCGAT TCGCTGGAGA AGAAGTTACG GGATGCAAAT
CAGGAAGCTG GGCTTGTCAG CGACGCAATG CGGGTTTTGG CCGCGCGACA CATTTTTTTG
GATTTGAACG CGGAAGATTT TCTTGCGTGG GTTGATGCAT CCTTGAAAGC GAACGGACTT
TCCAAGTTGG TATATATATG GGACGAGTTT TCGTCGTTTA TGGAACGCAA CCGTGCGGAA
TTAAAGACGC TTGAGCAGCT GGCAGAAGCC GCACAGTTGG GGCGGTTTTA TTTCGTACCG
GTGACCCACA CGGATATTTC TGCGTATGTG GCAGCAGGAT CCGAAAGCGC CAAAAAAGCG
AACGACCGAT TCATGTTCAA ACAGCTTGAC CTGCCGAACG AGACTGCGCT CAAACTGGCT
GCAGACGCCT TTGTCGTCAA GCCAGAGAAG GCCGTGGAGT GGACGCAAGA CCGGGACGAG
CTTTGGCGCA GTGTGAATCG CGTGGCCGAG AATTACATGG TAGTCAACAA CGCAGGCATC
GATGCTGCCG ATTTCAAGGG GATCCTGCCG CTGCACCCGA TGGCCGCATT CCTGCTGAAG
CACCTGTCGG TGGCGATTGG ATCAAACCAG CGAAGTATGT TCGAGTTCCT GAACGGAGAA
GAGTTCAGGG TATTCATTGA GAAGGGTGGA TTAGAGGTAG CCGGTCATCA GTTCCTGACG
GTGGACCATC TGTGGCGGTA TTTCGTGGAA CGGGAGGATC TTGGGACTGG GCAAGCAGTG
CAGGAAGTGC GAGCGGAGTA TGCAAGACGC GAAAAGGATC TGCAGCCTGA TGAGCAGCGC
GTATTCAAGG CGGTGTTGCT ATTCGGCTTG ATTGAACAGA TGCAGGGGAC AGGCCATCCG
CTGCTAAGCG TGACGAAGGA AAATATTCAG CGCAGCTTCG AGGGTGACGG TGCTTTGCAG
GGTGTGGATG CGATCTTAAG GGATTTGGAG CAGAAGCACT GTTTCACTAT TGTAGACGGC
CGCTGTGAGC GTTTCCGTGA TCGCTCAGAC TCGAAAGAAA TTGAGGAAAA AAAAGCTGCG
TTGGATGGTA AGTTTGCTGA ATATGTGTTG AAAGATACAG AGACGGAACT GGTTAAACAG
TTGAAAGGTG TCAACTATGG CGGTCGTTTT GATGTCCGTG CGGCGAGCGT CAACGGCCTT
TCAGCATCAA ACATAGCGAG ACGCGAGTCT TTTGGTGAGA CTGGGAACCG AGTGCTTGTT
CAGTTCATTT TGGCGCGTGA TGAGCAAGAA CAATTGCGCA TACCTGACAA GGCGAGAGAT
TTGGCTAAAC AGTTCAAAGA CCAGCGAATG CTGTTCGTGA CGCTGCCGGA AGTCAGTTTC
TGTCGTGATG ATAAAAAATC ATGGGAAAGC TTCAAGGAAA ATTTTGCCCG TTCTGCATTG
GCAGAAAAGT CCGGAGACAG CACCGGCAAG AAGGTGTTTG AGGCACAGGT CAGCAAGGCC
AAAGAGAAGT GGAATGAAAA AGTGACGAGT GCCGCTGCCA AACTGATCGT GTACAAACCC
AATCCAACCG GGGAGCCTTT CGTTGACGAA GTGACGTGGG GACAATTGAA AAAGGATTGG
CTCACTGACT ATGCGAAGCA GACATTCGAG GCGTATACGG ACGATCTGTC CGGCTTTAAC
ATCAAAGCAT TTGAAGTTCC GCCACGTGAC TTGCATAGGT GGGCTCTGGC GGGCATGGAG
TTTGATGGGT ATGCGCCCCC AGGCATGTGG AAGACCGTGG TGGCAACATG GCAGAAGAGC
GGCATTACCG GCGAGGATGC ATGGTTCGAT GCGAATCCGA ACCACCCGCT GACGCAGTTG
CGGGTCAAGT GCAAGGCATG GCAGGATAAC ACGGTGGGTG CTGGCAATAC CTGCTCCATA
CGCAAACTTT ACATCGATTT GCAACGTCCG CCTTTTGGAC TGATGGGAGT GCCGCATTCG
GCCTTCGTGT TGGGCTTTGT GCTGAAAACA TGGTTGACTG GACGGCGCAA CTTGCAATGG
TCGGACAACG TGACAAGCAA AGCCCTTGAC CGTTCAACGC TGGCGGAAAT AATCGATGCA
GTAGTGAAAG ACGATGGCGC CAACGCCATC AAAAACGAGA AGCTAATCTG TCTTCTGTCG
AAGGAGGAAA AGGCGTTCAT CGCACAGAGC AGCGTGATTT TCGAGACCAG CACACTTGTG
GACGGAACGG TGGAGGCAGC GCTGAACGCA GTGAGGACAC GGTTGGAACT GATTGCTGAG
CGCGTGCCGC TGTGGGTGCT GCCTGAATAT ATCTGTTCAC AAGCTGAGCC GAGCGCGGAA
GAGATGGGTA AAGTCATCGA TGCGCTATGC GCAGTGAACG GTATCAGTTC CAAAGGAGAC
ACCGAGACTC GCGGCACGAA GGTGAAGGAG ATCGGCGCTA TACTTCTGGC GACGCCGGGG
CTGGCGGAGG CTCTGGCCAA GTATATGAAA TCGACAGTGT TTGATGAGGC GTTCCAACGG
TATGTCGATA ACGCCAAGCC AGAGTTGAAA GCGGCGGCGG AACGCATGGG CGATTTGTCG
CGTAACTACT GCAAGGCGAT TAAAAACCGC TTTGCGGCGA CGAGCGGCTG GCTCTGGAAA
CGAGGCGATG CCGAGTCTGT TCTGGAAGAG GTTTACTGGC AAACACTGTG CGCCGAACAT
ATCCGCGGGC TGGCTGGTTC ATCAGGATAC ATGAACTTCG AGGATGCGCT GTGCCGCTTA
CGCAACGCGG TACTGAGTGA GAACAAGGTC CCGACTGAGT TCTGGGCTAA GAAGCATCCA
GCGTTGCAGC GCTTCTTTGA GTTGCTCAAC CGGCCGTTGT TGTCCGGCGA GGACGTGAAA
GCCTTTGGTG AGATCCTAGA ACAGCAAAAT GGGGTTATCT GCGAAGTGTT CTTTGACGTG
GCGCAAGCGC GGCAATTTGG AGCGATGCTG GAAATTTTCG GAGAAATTTG GCCTATGGCG
CCTGCCGAAA GTCGCGAGCT TTACAACACC TTCCCGTCAG GCATGGCACT AGCAGATGCT
CAGACTTTTA AGGCACAAGG ACGCGACAAG ATAGAGGAAT ACTGCCGGAC GCTGGTTTCC
AAGCTGGTAG CTACACTATG GCGCCAACAT ACAGACACGG AATCACCTGT CGAATGGAGC
CGCAAACACT CACTGCCCGC CGAATGTGTT TTGGCGGTAG ACGATGCCAA GGGCATCATT
GATGCGGTCG CGAATCCGGG AGGTGTATCG GCGGAACGTT TGCAGACCGT GCATAACGAA
CTTGAAAAAG AGGGCGCATT TGTAGATGTG GCCACGGCGG GTGAAAAGTT TCTCGAACGG
GTGCTGCCTG CTCGCTATCA AAAGTTCGGG TTCAGTGTCC GAGACCTGGG CGATTGGTTG
TGCAAAAATT TGAGTGACAC ACCGAGCCGA TGGCTGACGA ACGGAGGACT TCGCGAAGCC
GTCGAGGCGT TCGTGAAGCA GGGATATGAC AGTCATGCGC GGAAGCAGGC CGTTGAGAAA
GTGAACACGC TCTCCGATGC CGAGGCTAAG ATGCTGCTCC TTAAGATGAT TGATCAAATT
CCCGATGTAG GACTTTCGGT GCTGGAGTAA
 
Protein sequence
MACYKDYFGI RPDYAPCMTL ADINKTPETW LGFYPHDSFV DILRLLLKSL EGGNKTLWIT 
GAYGTGKSHA SLVLQKLFTD DDARVQKWLA LRKAQIPEPV REGLLARRGE KTLVVYDVNA
DGVDAKNQFL MRLQRGITKA LVSGGHTIPL KGKLDEVIER IRQDEPYFFA KRDTMQARLS
HLNAGIKTAD SLEKKLRDAN QEAGLVSDAM RVLAARHIFL DLNAEDFLAW VDASLKANGL
SKLVYIWDEF SSFMERNRAE LKTLEQLAEA AQLGRFYFVP VTHTDISAYV AAGSESAKKA
NDRFMFKQLD LPNETALKLA ADAFVVKPEK AVEWTQDRDE LWRSVNRVAE NYMVVNNAGI
DAADFKGILP LHPMAAFLLK HLSVAIGSNQ RSMFEFLNGE EFRVFIEKGG LEVAGHQFLT
VDHLWRYFVE REDLGTGQAV QEVRAEYARR EKDLQPDEQR VFKAVLLFGL IEQMQGTGHP
LLSVTKENIQ RSFEGDGALQ GVDAILRDLE QKHCFTIVDG RCERFRDRSD SKEIEEKKAA
LDGKFAEYVL KDTETELVKQ LKGVNYGGRF DVRAASVNGL SASNIARRES FGETGNRVLV
QFILARDEQE QLRIPDKARD LAKQFKDQRM LFVTLPEVSF CRDDKKSWES FKENFARSAL
AEKSGDSTGK KVFEAQVSKA KEKWNEKVTS AAAKLIVYKP NPTGEPFVDE VTWGQLKKDW
LTDYAKQTFE AYTDDLSGFN IKAFEVPPRD LHRWALAGME FDGYAPPGMW KTVVATWQKS
GITGEDAWFD ANPNHPLTQL RVKCKAWQDN TVGAGNTCSI RKLYIDLQRP PFGLMGVPHS
AFVLGFVLKT WLTGRRNLQW SDNVTSKALD RSTLAEIIDA VVKDDGANAI KNEKLICLLS
KEEKAFIAQS SVIFETSTLV DGTVEAALNA VRTRLELIAE RVPLWVLPEY ICSQAEPSAE
EMGKVIDALC AVNGISSKGD TETRGTKVKE IGAILLATPG LAEALAKYMK STVFDEAFQR
YVDNAKPELK AAAERMGDLS RNYCKAIKNR FAATSGWLWK RGDAESVLEE VYWQTLCAEH
IRGLAGSSGY MNFEDALCRL RNAVLSENKV PTEFWAKKHP ALQRFFELLN RPLLSGEDVK
AFGEILEQQN GVICEVFFDV AQARQFGAML EIFGEIWPMA PAESRELYNT FPSGMALADA
QTFKAQGRDK IEEYCRTLVS KLVATLWRQH TDTESPVEWS RKHSLPAECV LAVDDAKGII
DAVANPGGVS AERLQTVHNE LEKEGAFVDV ATAGEKFLER VLPARYQKFG FSVRDLGDWL
CKNLSDTPSR WLTNGGLREA VEAFVKQGYD SHARKQAVEK VNTLSDAEAK MLLLKMIDQI
PDVGLSVLE