Gene GM21_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1678 
Symbol 
ID8137009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1957257 
End bp1960964 
Gene Length3708 bp 
Protein Length1235 aa 
Translation table11 
GC content65% 
IMG OID644869290 
Producthypothetical protein 
Protein accessionYP_003021490 
Protein GI253700301 
COG category 
COG ID 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones121 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAA GCGACTGCAA ACAAAACCGC GATCCGCTGC AACCGGTACG GGAGGGAACC 
AGCCGGGGAG AGCGGCTCTC CCCCGCGCTG GACCCGGGCT ACGCGCCGGT GGACGAGCGG
ACCGCGGCAC ACCACATGGT GTTCGCGCAG GCGTACTCGG CCTTTCTCAA CTACTTCAAC
GCCGACAACG CGCTCGACGG TGACTGGCAA CCTTTCTTCA GCGGCGACGT TTCGGTGCAA
CTGGCGCTGG CGGCGGTGCA GGACGTCGAC GCCTACAGAA CCCGCATCAA GGAGTACCTC
GACTTCCTGA AGGATCTTGA CTTCCAGTCG GACGAGGCGA AGGCCAAAGA GCACCTGGGC
TACCTGTTCA GCGCCGCCGC GACGCTCGCG CGCCAGTTCG ACCTCTTTAG AAGCGGGCTT
CCAGCCGACC TGCCGCTGAA ATCCACCCTG CAGGCCCTGA TCCAGACCCA GCTCGCTCCC
GCCTTCAACC GGCTCACCGC CTATTTCAAG GCCGACCAGG CCCTCCCCCA GCCCGACCGC
CTCATCGCGG AAGCGGCCCC CGACCTGGTG GTTTTGGGGA AGAGCGCAGT CGCGTTCGCT
GAGGTTTACC AGGGTGGGCT GTCGGGGGAA TGGATCACGG ACGCCTCGGC GGACTGGGCC
GCTTACACGA ACGCCGTACC CCCCGATGCG TCCGTGTACG GTTCGGGGAT TGAGCCTTTC
CAGCGCATCA ACCACATAGC CACCCACAAC CTCTTCACCT CGGCCCTGGA CCAGTTTCTC
AAGGCGTACG CCCGGACCGC GGCGGAAGCC GCCAAGGGAC TCGAAGCCTC GCTCACCGAC
CGGGACGACC ACCACCCCCA TTACGCTCTC TTCCTGACCT TTCTGAGGCT CTTCGCCTAC
GTGCGCGACG CCGCCAACAC CCTGACCGGC CGCCATCTCG ACTTCTATTA CCGCGACATC
CTGCAACTCA AGGAAAAGGA GGCCAGGCCC AGCCACGCGC ACCTCCTTAT CGAACTCGCC
AAGCAGGCAC CCGAGCACCT GGTGCGGGCC GGCGAACTCT TCAAGGCCGG AAAGGACGCG
CTCGGGCACG ACGCCTTTTT CGCCAACGAC CGCGACTTCG TTGCGAACCA GGCGCAGGTG
GCCGCACTAA AGACGGTGTA CCGGCACGGC ACGGAGAAGG TTGGGGTCAC CGCCCCCAGT
CCCCTGCAGC AGGGACGGAT CTTCGCCTCG CCGGTCGCCA ACTCGGAAGA CGGCATCGGC
GGCGAACTGA CCTCCGAGGA CAAGTCGTGG CACCCCTTCC ACAACAAGAA ATACCAGGAC
GGGACCCTGG CAAAGATAGA CATGCCTAAA GTGGAGATCG GGTTCGGCAT CGCGTCACAC
TACCTCTACC TTGCCCAGGG GGAAAGGAGC GTGACGCTCA GTTTCCAGAG TCAGCCGGGA
GTGTCCCGGG ACTACAAGAA CGACGTTGCC TGCCTGTTCA GCTCGGAAAA GGGTTGGCTG
GAAAAGGCGC CGCTCTCCTT CGCCCAGGAG ACCGGGGGGC TGGCGCTCCG GGTCGCCCTC
ACCGGGGCGG ACCCAGCCGT CGTCGCCTAT TCCGCAAAGC TCCACGGCTA CAGCTTCGCC
ACCGAACTTC CGGTGCTGCT GATACGGCTC AAGCACCGCG ACGACGCCCC CTTCGCCTAC
CCGGAGTTGG AGCCGCTCAC GCTGAACCGG GTGAAACTCG CCGTAGAGGT GACGGGGCTG
AAGACAGCGG CCCTGTCCAA CGACTTCGGT CCGGTGGACG CCTCCAAGCC CTTTCAGCCT
TTCGGCTCTT CCCCGGCAGC CAACAGCTCC TTCGTCATCG GCTCAGGCGA GATGTTCCAG
AAGTCGCTCA CGAAGGCAGC GGTGAACGTC CAGTGGCAAA GCGCCCCCGC CCCCTTCAAA
GGCAAGAGCG TGACTGCGAT GACTGAATAC CTGCAAGGTG GGGTCTGGCG CCAGTTCAAG
ACGAGCCTTG AGGGAGACGT CACCAGCACC ACCTTCGACC TCTTGGGCGA ACCGCAGTCG
GCGGACGCCC CCCCCTATTC CGATTCCCCC GCCGCGCCGG AAAACGAGCA GTACCAGACC
TCCTCGCGAA ACGGCTTCGT CCGCCTGAGG CTCGCCGCCG ACTTCGGCCA AAGCGACTAC
GAGCAGGCGC TCATCGATTA CATCAAGCGC ATCACCGACG GCGACCCCAA TCCGGGGTCC
AAGCCGATCG CCCCGACAGG CCCATACGTC ACCGGCCTGA CGCTTGACTA CGGCGCGCAA
CAGACCATAA CCCTGGACCA GGCGGACCTG GACAAGTTCA AGGCGAGGCA GGCCCTCTTT
TTCCACCTGA ACCCCTTCGG TTACACGGAG CAGCACCCAT ATTTGAAGAC GACGGTGCCG
GCTGCTCACC CCTCCATGAA GCTGGTGCCG CAGTTCAAGC ACGCAAACCG CGAGGACCCG
ACTCTTCCCA AGGGGGTGCC GGTGCCGCAC GAGGCCGAGT TCTACATAGG GGTCGCAGGC
CTCGCTCCGC CGCAGAACCT GGCGCTATTG TTCCAGGTGG TCGACGGGAG CGCCGACCCG
CTCTCCATCA AGCCGAAGCC GCATATCGAC TGGAGCTACC TGCAGCAAAA CGAGTGGGTC
CCCTTCGGAC ACGGCGAGGT TGAAGACCAA ACGGGCGAAC TGATCGACTC GGGCATCGTC
ACCCTCTCCA TCCCCCGCGG CGCCTCCAGC GACAACACGC TGCTCCCGGC AGAGCTGCAT
TGGGTGCGCG CGGCGGTGAA AAGCAACAGC GATTCGGTCT GCCGCCTGAT CCTCGTCGCC
GCCCAGGCGC TGCAGGCGAC GTTTGAGGAC CAGGGAAACG ACCCCGCTTT CGTCGGGGGC
ATGCTCCCCG CAGGGAGCAT CGCGAAATTG GCGCAGCCCG ACGCCGACGT GAAGAAGGCG
AGCCAGCCCT TCGCCGGCTT CGGGGGGCGG GGGAAGGAGG CGCCCGCCGA CTTCTACACC
AGGGTCTCGG AGCGGCTGCG CCACAAGGAC CGCGCCATCG CGCAGTGGGA CTACGAGCGG
CTGGTGCTGG AAGGGTTCCC CCAGATCTAC CGGGTCAAGT GCCTGAACCA CACGCAGTAC
GAGCCGGACG CGACCGGGAG CGGGATCTAC CGGGAACTCG CCCCCGGGCA CGTCACCGTG
GTCACCATCC CGGATCTGCG CTTCGCAGCC CTTAGGGACC CGCTGCGCCC GTACACGAGC
CTTGGGCTCC TGGACAAGAT CTCCGCATAC CTTGCGCAGC GGCTTTCCTG CTTCGTGCGG
CTGCACGTAA GGAACCCGCT CTTCGAGGAG GTGCAGGTCG ACTGCAAGGT AACCCTGCAA
CCGGGGCTGG ACCAGAGCTT TTACGAACTG AAGGTCAAGG AGGCGATCAC CCGTTTCCTT
TCCCCCTGGG CCTTCCCCGG CGGGGGGAAT CCATCCTTCG GCGGCAAGGT GAGGAAGTCT
GTCCTCATCG ACTTCGTGGA GGAACTCCCT TACGTGGAAT GCGTGATGGA CGTGAGGCTC
CTGCACAGCT ACATCGACGC CTATGGCTTC GCCCGCACCG ACGAAGTCGA CGAGGCGGCC
GGTTCCACCG CCGTCTCCAT CCTGGTCTCG GCCCGCAAAC ACCTGGTCGC GACCATTGAC
CCAGCCGAGG AGGAGACTCC CGGTGAACTC TGCCGGTGCC CCGCATGA
 
Protein sequence
MSRSDCKQNR DPLQPVREGT SRGERLSPAL DPGYAPVDER TAAHHMVFAQ AYSAFLNYFN 
ADNALDGDWQ PFFSGDVSVQ LALAAVQDVD AYRTRIKEYL DFLKDLDFQS DEAKAKEHLG
YLFSAAATLA RQFDLFRSGL PADLPLKSTL QALIQTQLAP AFNRLTAYFK ADQALPQPDR
LIAEAAPDLV VLGKSAVAFA EVYQGGLSGE WITDASADWA AYTNAVPPDA SVYGSGIEPF
QRINHIATHN LFTSALDQFL KAYARTAAEA AKGLEASLTD RDDHHPHYAL FLTFLRLFAY
VRDAANTLTG RHLDFYYRDI LQLKEKEARP SHAHLLIELA KQAPEHLVRA GELFKAGKDA
LGHDAFFAND RDFVANQAQV AALKTVYRHG TEKVGVTAPS PLQQGRIFAS PVANSEDGIG
GELTSEDKSW HPFHNKKYQD GTLAKIDMPK VEIGFGIASH YLYLAQGERS VTLSFQSQPG
VSRDYKNDVA CLFSSEKGWL EKAPLSFAQE TGGLALRVAL TGADPAVVAY SAKLHGYSFA
TELPVLLIRL KHRDDAPFAY PELEPLTLNR VKLAVEVTGL KTAALSNDFG PVDASKPFQP
FGSSPAANSS FVIGSGEMFQ KSLTKAAVNV QWQSAPAPFK GKSVTAMTEY LQGGVWRQFK
TSLEGDVTST TFDLLGEPQS ADAPPYSDSP AAPENEQYQT SSRNGFVRLR LAADFGQSDY
EQALIDYIKR ITDGDPNPGS KPIAPTGPYV TGLTLDYGAQ QTITLDQADL DKFKARQALF
FHLNPFGYTE QHPYLKTTVP AAHPSMKLVP QFKHANREDP TLPKGVPVPH EAEFYIGVAG
LAPPQNLALL FQVVDGSADP LSIKPKPHID WSYLQQNEWV PFGHGEVEDQ TGELIDSGIV
TLSIPRGASS DNTLLPAELH WVRAAVKSNS DSVCRLILVA AQALQATFED QGNDPAFVGG
MLPAGSIAKL AQPDADVKKA SQPFAGFGGR GKEAPADFYT RVSERLRHKD RAIAQWDYER
LVLEGFPQIY RVKCLNHTQY EPDATGSGIY RELAPGHVTV VTIPDLRFAA LRDPLRPYTS
LGLLDKISAY LAQRLSCFVR LHVRNPLFEE VQVDCKVTLQ PGLDQSFYEL KVKEAITRFL
SPWAFPGGGN PSFGGKVRKS VLIDFVEELP YVECVMDVRL LHSYIDAYGF ARTDEVDEAA
GSTAVSILVS ARKHLVATID PAEEETPGEL CRCPA