Gene GM21_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1145 
Symbol 
ID8136467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1335572 
End bp1337722 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content61% 
IMG OID644868756 
Producthypothetical protein 
Protein accessionYP_003020964 
Protein GI253699775 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones111 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACA AGCATCTTGA TCTTGTATCT ACCGTAATCT CGCCTCTTAT CTTGCTGCTG 
CTGGCAGCCG GCAGTGTCTG CGCGGCAGAG GGGGGCATCG TCGAGGTGCC GGTGGCGGAA
GCAGACCTGA TGGAAGCTCC GGCTGAGCCG GAGCCGCAGC TGCTGGTGGA CCTGACCCCG
CTCCCGGCCG AGTACGACCG GCAGGTCGAG GTGCTGGAGC AAAAGGGAGG ATCGCTTGGC
TACAACTTCC TTTTCAAGGA CGGGCCCGCC GGGCGTGCCC TCGAGTACGG CTTCCTCGAT
TCCAGCCGCA CTGGCGGCGT TTTTTACCGC CACATGGAAA AGGACCGCAA CCTCGAGCTG
GACGGGTTCT ACCTCAACGA GAACGACTAT CACGGCGACC TGATCCTGGA CTATCGCGGC
GATTACCGCC TGCATTTGAG GACCGAGTCC TTTTACCACA ACCTGGATCG CGAAATTCTG
TTCTCCACTC GGTTTCAGTC GGCGTCGACC AACGGTGGTC TGGCGAGCTA CGAGCCCGAA
CAGCAGGACA TCGGCGCAAG CTACGGCGTA AGCGTGGTGC AGGACCGGGC CGAGTTCCGC
TACCGGTTGC ACGACTTCCC ACTGCACCTG AATCTCGGCT ACTGGCGCTA CCAAAGGGAA
GGTACCACGC AGCAGATTTT CGCTGACACC TCGTTCGAGG GGACGGAGAA CCGGGTGTTC
GCCCAGGCAA GGCCGGTCAA TCAGCAGGTT CAGGAAGGGC GCTTGGGTTT CGACGCCCAT
CTCGGCCCAG TGGACGTCAT CTACGAGTTC AAGTTCCGCA GCTTCGAGGA TAAGCTCTCC
ACCCCGGTTG AGACCTTTGT GGCACGTCCC GATCTGGCCG GAAATCCCCT CATCCTGGCA
GGGGTGCAGC AGCACAACGA AAACCCCGAT AGCCGTTTCT ACTCCCATAC GATAAAACTA
CACAGCTCCC TGTCCGGCGG GCTCGTGACC GGCGCCTCCT ACGCCGTTGA ACAGCGGGAA
AACCTTTCCA AGCTGACCGA TACCGTCGGG GTACAGCACG CAAGGACCTA CCTGCAAAAT
GCCGCCGGCG ACTTCGTCTA CACGCCCAAC CGCGAGTACA CCTTCTCCGT CAAGTACCGC
CGCCAAGAGG TGGATCACGG CAACCGCGGC GCGGTCCTAA GCTCCAACCT GCTTCTTCCC
GCATCGCCGT CGCGCCCCCC CGTCGACACT GTGAAGGATG TCATCATCGC CTCGGTTTCC
CACAAGCCGC AGCTAAACCT CTCGCTGGTG GGCGAGTACC GCGGAGAGTT CCTGCAGAGA
AAGCACGTCT CCCTGTTCCC GTCAGAAGAG AGTTGGGCCC TGCCGGAGAA TTCCAACACC
AACACTGGAT CGCTCGCGGC CCACTACCGC CCGGTCAAGG GGCTGCGTAC CAGCGCCCAT
CTGTCCTACG CCACGACGGA CCACCCCTCC TACGGGGCGT CGTTTAACGA AAGGCGCGAG
GGAAAGCTTC TGGCCAATTA CACGCGCAGC AACGTCTGGG GATTGACGGC CCACGCCATG
TTCAGGCATG ACAAGAACGA CGAGGTCCCG CACTACCTGA TTCTGTTGGA CCCGGCGAAC
CCCGAACCGG TAAGCTATAC CACTTCTCCC TTCACCTACC GGGACAAGCG TAGCGAAAAC
TCGAACATAG GCGCATGGAT AGTTCCGGTA CACAGGCTGA CCCTGGGGAT GAACTACTCC
TACCTACGCG ACAAGATCGA TCAGCCGGTG CTCTTCACCG GTGTCTTTGT CCCCAGTCAA
GCCGGGGCCA AGTATTTGAA CCGGGCTCAT GTCTATTCCC TCAACGCCTC CTTTGCGGCG
ACGGAAAAGC TCGACCTCGC GCTTTTACTG CAGCAAAGCC GGTCGAGCTC GACGTTCACC
CCGGACGCGA TGGAGTTCAC GGGTCTGATC CCGGGGAGCA CCGCGGGGAT CGGCGAGCTG
AGCGCCTTCA GTGCCGTGCA GCGCACCGTT GCGGCCCGGG GGGAGTATCG TTTCAACGAG
GTGCTGTCGA CCTCGCTTGA GTACACCTTC AGGGATTACG ACGACAAGAA AGACGATAGC
AGCGACGGCT CGGCTCATGC GATAGTAGCC GTGGTCGCGG CCAAGTGGTA A
 
Protein sequence
MVNKHLDLVS TVISPLILLL LAAGSVCAAE GGIVEVPVAE ADLMEAPAEP EPQLLVDLTP 
LPAEYDRQVE VLEQKGGSLG YNFLFKDGPA GRALEYGFLD SSRTGGVFYR HMEKDRNLEL
DGFYLNENDY HGDLILDYRG DYRLHLRTES FYHNLDREIL FSTRFQSAST NGGLASYEPE
QQDIGASYGV SVVQDRAEFR YRLHDFPLHL NLGYWRYQRE GTTQQIFADT SFEGTENRVF
AQARPVNQQV QEGRLGFDAH LGPVDVIYEF KFRSFEDKLS TPVETFVARP DLAGNPLILA
GVQQHNENPD SRFYSHTIKL HSSLSGGLVT GASYAVEQRE NLSKLTDTVG VQHARTYLQN
AAGDFVYTPN REYTFSVKYR RQEVDHGNRG AVLSSNLLLP ASPSRPPVDT VKDVIIASVS
HKPQLNLSLV GEYRGEFLQR KHVSLFPSEE SWALPENSNT NTGSLAAHYR PVKGLRTSAH
LSYATTDHPS YGASFNERRE GKLLANYTRS NVWGLTAHAM FRHDKNDEVP HYLILLDPAN
PEPVSYTTSP FTYRDKRSEN SNIGAWIVPV HRLTLGMNYS YLRDKIDQPV LFTGVFVPSQ
AGAKYLNRAH VYSLNASFAA TEKLDLALLL QQSRSSSTFT PDAMEFTGLI PGSTAGIGEL
SAFSAVQRTV AARGEYRFNE VLSTSLEYTF RDYDDKKDDS SDGSAHAIVA VVAAKW