Gene GM21_2605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2605 
Symbol 
ID8137947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3037590 
End bp3038744 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content53% 
IMG OID644870211 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_003022401 
Protein GI253701212 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones138 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTGATC TCCTTCTTGT CCTAGCTAAG AACTGGAAGA TGATTGTAGG AGTTCCTTTT 
GTCGTAGCAG TCATCACCGG TATTTGCACA CTGTTCATGC CAAACATTTA TACCGCCAAA
GTCATGATCC TGCCTGGAGA TAACAGTAGC GGAGTGATGA GTACGATGTT GGCCCAGATG
GGTGGTCTGG CCGGTCTCGC CGGCGGCTTA GGAGGTACCA CCAAAGCTGA TCTTTATGTG
ACAATGCTCA AAAGTGAAAC GCTTAAAGAC CCGCTCATAG ACCGCTTTAA GCTGATGGAG
CTGTATGAAA CCAAGTTTCG CGCCAATGCC TATAAAGCGA TGGACGGTAA TGCGGCTATC
AGCACCGGGA AAAAAGACGG GATTATCACC ATTGCGATCT CAGACAAGGA TCCAAAACTG
GCAGCGGCCA TAGCCAATGC CTATGTAAAC GAGCTGGGCA AGATGGCGGC AAAGCTGGAT
ATGGCCGGCG CTGGCATGAG CAGGGTTTTC CTGGAAGAGC GTCTCACCAA GGCCAAGGTA
GATCTGGCTG CGGCGGAGGA GACGCTTAAG TCGTTCCAGA CCAAAAACAA AGTTGTGGCA
GTCACCGATC AGGCCAAGGC GACACTGGTA GGGGCGGCGC AGCTGCGGGC CCAGTTGGTT
GCCCAGGAAG TGCAGATGGC AACCATGCGG CAGCAGTTCA CCGATGAAAG TCACGAACTC
AAATCGATCA AGGCAACCAT TGTCAGTCTG CGCGGACAGA TTGCCCGTCT GGAAGGGAGC
GGTTCGACAG GTTCCATGCC CGGTGTCGGG GCTATGCCGC AGTTGGAGCA GGAATATATA
CGACTGATGC GGGAGTTCAA GGTCCAGGAA ACACTGGTGG AACTGCTGAC CAAACAGTAT
GAGATGACCA AACTGAACGA GGCCAAGGAT GTGGTGCCGT TTCAGATTTT GCAGTTGGCC
AAGGTGCCGG AGCTGAAGAG TAAACCAAAA CGGAGCTCGA TTGTGATCAT AGCAGCCTTT
GCCAGCGGTT TTCTGATGGT GCTGACGGCT TTCGTGCGCG AATTTGGGGC GAAGATGAAT
GATGAGGATC GTACGCGGTG GCAGGAACTG CAAAGAGTGC TGCCGCTGCC GCGTCGTTCT
AGGAATGAAG AGTAG
 
Protein sequence
MLDLLLVLAK NWKMIVGVPF VVAVITGICT LFMPNIYTAK VMILPGDNSS GVMSTMLAQM 
GGLAGLAGGL GGTTKADLYV TMLKSETLKD PLIDRFKLME LYETKFRANA YKAMDGNAAI
STGKKDGIIT IAISDKDPKL AAAIANAYVN ELGKMAAKLD MAGAGMSRVF LEERLTKAKV
DLAAAEETLK SFQTKNKVVA VTDQAKATLV GAAQLRAQLV AQEVQMATMR QQFTDESHEL
KSIKATIVSL RGQIARLEGS GSTGSMPGVG AMPQLEQEYI RLMREFKVQE TLVELLTKQY
EMTKLNEAKD VVPFQILQLA KVPELKSKPK RSSIVIIAAF ASGFLMVLTA FVREFGAKMN
DEDRTRWQEL QRVLPLPRRS RNEE