Gene GM21_2477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2477 
Symbol 
ID8137818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2896671 
End bp2898170 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content58% 
IMG OID644870087 
Productpolysaccharide chain length determinant protein, PEP-CTERM locus subfamily 
Protein accessionYP_003022278 
Protein GI253701089 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones136 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCGC CAGAGACCGA GTACAAAAAG TATCTGCAAC TGTTGTTCAG CAACAAGGAG 
CGATTTGTCG TCATCGCCCT GCTGCTGATG ACCGTGGCCT TCGTGGTCAG TTTCGTGCTT
CCGCGCAAGT ACCAGGCCAC GAGCACCGTA TTTATCGAGA AGAACGTGAT CAGCGAGCTG
GTCAAGGGGA TCACGGTGAC CCCCTCCATG GAAGACACCA TCAACGTCCT CACCTACGAG
ATCACCAGCC GCACCTTGCT CGCCAAGGTC GTGGACAACC TCGATCTGGA TCTGGGCAAG
AACGACAGCG AGACCGAGGA GCTGATCAAG CAGCTGCAGC TGAACACCAA GGTGAAGGTG
AAGGACAAGA ATCTCTTCAC CATCTCCTTC ACCCACACCA ATCCCAGCAT AGCCAGGGAC
TACGTCAACA CGCTCGTGCG CCTTTACATC GAAGGGAACA TCTCCTCCAA ACGCGGCGAG
TCCTATGACG CCACCAAATT CCTCTCCGAA CAGATCGACA CCTTCAACGA GAAGCTGCAA
AAGGCCGAGA ACGAGGTCAA CGCCTACAAG CGCGATAAGG GGGGGATCAT CGCCATCGAC
GAGGGGAAGC TCTTCGAGGA GATCAACATC GCGCAGCAGA AGCTCTACGA CCTCGAACTC
AGGCGCCGCC AGTTGGAAGG GATGCGCCAG ATCACCAAGA GGACCGGTGA CCCCCTGCAG
AACAGGCTCG CCGGGCTGCA GAAAAGGCTC GACGAACTGC TGGTCGAGTA CACCGAAAAT
TTCCCCGAGG TGGTGAAGGT CAAGGGGGAC ATCGAGACGG TGAAGGCTCA GCTGTCGGCG
CGCCGGGGTC AGCAGTCCCA GTCGCTCGAT CCCGCGGAGC TGGCCAAGAT CGAATCCGAA
ATCTCCGCGA TCAAAATCAC TGAAAGCGGC CTGAGGCGCT ACATAGACAC CAACCGTTCC
CTCTTGCAGA CCATCCCTTC GGCTAAGGCG GGACTCGAGA AGCTGGAGTT GGAGAAGAAA
AACCAGAAGA ACATCTACGA CCAGCTCTTT GCCCGTCACG GGCAGTCCGA GGTCTCCAAG
CAGATGGAGG TGCAGGACAA ATCCACCACC TTCCGCATCG TCGACCCGGC CCTGCTCCCG
GTCAAGCCTT CCAGTCCGGA TCGGCTGAAG CTGATGCTGC TGGGGATGGT GGGGGGGGTG
GCGGGAAGCT TCGCGCTGCT TTTCCTGATC GACCAGATGA ACAACACGGT GAAAGAGGTG
GAGTTCGTAA AGGGGCTGGG GGTGCCGGTC CTGGCGGTCA TTCCGAGGCT GCAGGATCCG
GAGGTCGAGG CGAAGAGGCG CAGGCGCTCG CGGCTGATCC TTGGGGGGGC GCTTATGTAC
TTCCTGGTGT TGATGGTTTT CCCCGGCATG GAACTCCTGG GGCTCCCTTA TATGGACAAG
GTGTTGGACT TATTGTCCGG GCCGGAAGCC CGGTTGCGGA TCAAGGGGCT TTTGCAGTGA
 
Protein sequence
MQSPETEYKK YLQLLFSNKE RFVVIALLLM TVAFVVSFVL PRKYQATSTV FIEKNVISEL 
VKGITVTPSM EDTINVLTYE ITSRTLLAKV VDNLDLDLGK NDSETEELIK QLQLNTKVKV
KDKNLFTISF THTNPSIARD YVNTLVRLYI EGNISSKRGE SYDATKFLSE QIDTFNEKLQ
KAENEVNAYK RDKGGIIAID EGKLFEEINI AQQKLYDLEL RRRQLEGMRQ ITKRTGDPLQ
NRLAGLQKRL DELLVEYTEN FPEVVKVKGD IETVKAQLSA RRGQQSQSLD PAELAKIESE
ISAIKITESG LRRYIDTNRS LLQTIPSAKA GLEKLELEKK NQKNIYDQLF ARHGQSEVSK
QMEVQDKSTT FRIVDPALLP VKPSSPDRLK LMLLGMVGGV AGSFALLFLI DQMNNTVKEV
EFVKGLGVPV LAVIPRLQDP EVEAKRRRRS RLILGGALMY FLVLMVFPGM ELLGLPYMDK
VLDLLSGPEA RLRIKGLLQ