Gene GM21_1677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1677 
Symbol 
ID8137008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1953865 
End bp1957260 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content64% 
IMG OID644869289 
Producthypothetical protein 
Protein accessionYP_003021489 
Protein GI253700300 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones129 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAC CTTCCCCTGT CATAGCGAAC CTGCCGCAGC TTCCCCCCGA GGAGGATTTC 
CACCGCCTGC GCCGCGAGGG GATCGGCTTC ATCGTGCAGA TGGGAAGCCG CCTCTGGACC
GAGTACAACG AGACCGACTC CGGAATCGCC ATACTGGAGG CGCTCTGTTA CGCCGTAACC
GACCTAGGCT ACCGCAGCGG CTGGGAGATC CGGGACCTCC TGGCGCCTCC CCTCCCCTCC
CCCGACCCGG CGCACCCCTT CCCCAACCAG CCCTTCTTCA CCGCTCGGGA GATCCTCACG
GTCAACCCGT GGACCCCCGA CGACTTCAGG CGGCTTTTGA TCGACCAGGA GGCGGTGCGA
AACGCCTGGA TCGCCTGCAA GGAGTGCGCG TGCGATACCA GCTACTACGC CTGGTGCGAG
GGAGAGAGGC TCGCCCTTTC CTACCAGTTG CCGGAAAACC GAAGGCTCAG GCCCAAGGAG
GTTTGGCCGC TGGGGCTGTA CGAGGCGCTC TTGGAATTGG AGGCGGACGC GGACCTTGGA
GACCTGAACG ACCGCAAGGT CGAAGCCGCC GTCACGCTGG AGGACGCAAA CGGCAAGCAC
CCGCTCACGA CGGAGCTCCG CTTTCACGAC ATGGCCCTCT CGGACCGGGT CGGCTGGGGA
CTTTTTCTGG GGAGCGACGA CGCATTCGCC GGACGCAACG GCCAATCCTT CAACCTGAAG
CTGATCGGCT TCGGTGCTAC GCGCAATTAC GACCTCTTGA CGGACCCGAA CCTGGACGAC
GCCGGGCGCA ACGACTACCT GCGCCGGCAC TGGCGCGACC TCTTCTACCT CGCCCTGGAA
ATCGAGATGG TACCGACCGG CAAGAAGATC GTGCTGCATG CGACGCTGAG GTTCTTGGGG
GACGCTGCGG CGAGGGGTGC CGCGACGGTG GCGGCGCTTA AGGGGATCTT GGAGGAGACG
GGCGTGAACG GGCTCACGCA GCGCTACCGC AAAAAAGAGC TGCAAAAGGC GGCCGGTGTG
GCGCGGGCGA AGGAATCGCT TTTCTCGCAC CGGAACCTCG ACGAGGAGTT CTGCCGGGTG
AAGCTGATCG GGATCGAGGA GGTGGCCGCC TGCGCCGACG TGGAGGTATC CCCCGAGGTC
GACATCGAGC TGGTGCAGGC GCGGATCTGG TTCGAGATCG AGCAGTACCT GAACCAGGCG
GTCCCGTTCT ACACGCTCCG TGAGATGCTG GAGCAGGGTT TTCCGGTGGA GGAGATCTTC
AACGGGCCGG CGCTTAAGAG CGGCTTCATC AGGACGACCG ACCTGGAGCA GGCGACGCTG
AGATCCGTCC TCTGCGTCTC CGACCTCCTT AACCGGCTGA TGGAGATAGA CGGCGTCCTA
GCGGTGAACC ACCTGCAGCT CACCAAGTAC GACCCGGAAG GAAAGGCGGT CAAGGGTGCC
GCCGACCCGG CCTGGACGAG CGACGGCAAA CCGATCTTCG ACCCCGGCAA GATCAGCGCC
TCCTGGCTTT TGTACCTGAG CCCCCAGCAC CTGCCCAGGC TTTACCGCAA CGCCTCGCGC
TTTCTCTTCT ATAAGAACGG GCTCCCCTTT CTCCCCCGGA TGGATGAGGC CCTGGCGACA
CTTACCCAGT TGCGCGGTGA GGCCGAGCGG ATGCGGGTGA AGAACGCCCC GAACGATCTC
CCCATTCCAG CGGGGAACTA CCGGGACCCG GCGGCGTACT TCCCGGTCCA GTACAGCTTT
CCCCTCACCT ACGGCATCGG GGTGGACCAA CTTCCTGCCA ACGCGAACGC GAAGCGAAGG
GCGCAGGCGA AGCAGTTTAA GGGCTACCTC ATGGTGTTCG AGCAGCTCCT CGCCGACGCC
CTGGAGCAGC TCGCGCACAC GGCCGACCTC TTCTCGCTGG ACCCGCTGGT GAAGCGGACC
TACTTCGCCG CACACCTGAG CGAGGCGTTG ATCCAGGGGT ACAACGAGCT CTCCACCATC
ACCCAGGCGA CCCTTGAAGC GCTGCTCGAA AAGGAGCCGG AATTCCTCAA GCGGCGCAAC
AGGTTTCTCG ATCACGTGCT GGCGCGGTTC GGCGGGGAAT ACAGGGAATT CACCCTGCTC
CTGGAGAAGC TGCAGGGACA GCAGGTAGCG CTTGGAGCGC TTATCGGCGA CAAGATCGAC
TTCATCACCG CCTACCCGGT CGTAAGCCGC GACCGGGCCA AGGCTTTCAA CAGGGAGCTT
GCCTGCGCGC CGGGGAACGA CCCCGCGATC AAGCGGCGCA TCGCGCTGCT TCTTGGGAAA
AAGGAGTTGA GCGACCGGAT CATCGTGGTC GAGCACCTGC TTTTGCGCCC GAAATTCCCC
GGGGACGCGC TCTACCCCGC ATGCAGCGAT GGCGCCTGCC GGCTATGCGG AGAAGAGGAC
CCCTATTCAT TCAGGCTCAC CCTGGCGATG CCGGGGTGGA TGGAACCGTT TGACTCGGAC
CTGGTGATGA GGGAGTACGC CGACCGGGTC ATAAGACAGG AGCTGCCGTC GCACCTGGTG
GGGAAGATCT GCTGGGTCGG CAACGACGGC TTCGAGGAGG ACCCGTGCGA CCCCGTCATA
TTGGAGCTGA CGCGATTAAT CGAGGAGAAG GGGAACGGCA TAGCCGGGGT CCGCCCCACC
GAGGACGAGG CCTGCGCCTG CGCCCTAGGG GCGTACCACG AATTCTCCCA GGTATTCCGC
GAGTGGTACC AGGACAAGGT GCTGCGGCAT ATCCACCCGG ACGCCCTGAA GCAGCAACTG
GAGCTGTTGT TCGGCCAGAA AGTGGATCGC GCCACCATAC CCTGCGCCGC CGTCTGGGAC
GACGAGCTGT GGGCGGAGGT GACGAAGCTT TTGGTCGGGC GTTTCCTGGA GATCGCCCTT
TACGGCTTCC AGTTCGACCG CTTCCAGGCG GCCTGGTGCG CCTGGCTGGA AGCCGACGCC
GCATTCGACT GGACGGAAGA GCGCCTGCAG GAACGGCTGC AGGCTATCCT TACCGAGAAT
CTCCTCTCCA GTTCCGCCGA CCTCGGATCC CCGGCCGGCC GGATCTGCCG CTGCGCCGAA
CGGATCCTGC GCAGCTACGG CGCCACGTTC GACCTCTGGA TGCAGGGTCT CGTAGCTTCG
GACAGCTTCG ATCCCGACGC CCCTTTGCCC CCCTTCCCGC TCGATCCACC GCCGGAGTGC
GCCGGACTCG GTTTCAAGGC GGGTACCATG GCGCGGTTGA AGGAGCTCGT CGAAGACAGG
TACGGCGCCT ACAGGAATGT CTCCTACCGG CTCCGGGTCG TGCTGGACCT CCTGGGGAGG
CTGCGGAACG TTTACCCTCC GGCGACCCTG CACGACTGCG ACGAAGGCGG CGACAAAAAC
CCGGTGCGGC TGGGGACAAC AGCTTTAGGA AACTGA
 
Protein sequence
MSQPSPVIAN LPQLPPEEDF HRLRREGIGF IVQMGSRLWT EYNETDSGIA ILEALCYAVT 
DLGYRSGWEI RDLLAPPLPS PDPAHPFPNQ PFFTAREILT VNPWTPDDFR RLLIDQEAVR
NAWIACKECA CDTSYYAWCE GERLALSYQL PENRRLRPKE VWPLGLYEAL LELEADADLG
DLNDRKVEAA VTLEDANGKH PLTTELRFHD MALSDRVGWG LFLGSDDAFA GRNGQSFNLK
LIGFGATRNY DLLTDPNLDD AGRNDYLRRH WRDLFYLALE IEMVPTGKKI VLHATLRFLG
DAAARGAATV AALKGILEET GVNGLTQRYR KKELQKAAGV ARAKESLFSH RNLDEEFCRV
KLIGIEEVAA CADVEVSPEV DIELVQARIW FEIEQYLNQA VPFYTLREML EQGFPVEEIF
NGPALKSGFI RTTDLEQATL RSVLCVSDLL NRLMEIDGVL AVNHLQLTKY DPEGKAVKGA
ADPAWTSDGK PIFDPGKISA SWLLYLSPQH LPRLYRNASR FLFYKNGLPF LPRMDEALAT
LTQLRGEAER MRVKNAPNDL PIPAGNYRDP AAYFPVQYSF PLTYGIGVDQ LPANANAKRR
AQAKQFKGYL MVFEQLLADA LEQLAHTADL FSLDPLVKRT YFAAHLSEAL IQGYNELSTI
TQATLEALLE KEPEFLKRRN RFLDHVLARF GGEYREFTLL LEKLQGQQVA LGALIGDKID
FITAYPVVSR DRAKAFNREL ACAPGNDPAI KRRIALLLGK KELSDRIIVV EHLLLRPKFP
GDALYPACSD GACRLCGEED PYSFRLTLAM PGWMEPFDSD LVMREYADRV IRQELPSHLV
GKICWVGNDG FEEDPCDPVI LELTRLIEEK GNGIAGVRPT EDEACACALG AYHEFSQVFR
EWYQDKVLRH IHPDALKQQL ELLFGQKVDR ATIPCAAVWD DELWAEVTKL LVGRFLEIAL
YGFQFDRFQA AWCAWLEADA AFDWTEERLQ ERLQAILTEN LLSSSADLGS PAGRICRCAE
RILRSYGATF DLWMQGLVAS DSFDPDAPLP PFPLDPPPEC AGLGFKAGTM ARLKELVEDR
YGAYRNVSYR LRVVLDLLGR LRNVYPPATL HDCDEGGDKN PVRLGTTALG N