Gene GM21_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2231 
Symbol 
ID8137570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2602328 
End bp2603443 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content66% 
IMG OID644869846 
Productglycosyl transferase family 9 
Protein accessionYP_003022038 
Protein GI253700849 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.00858313 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGGGCGG TGCCCCAGCC AAGGTCTTGC CTTCGTGGCG TCTCCCTCAC CCTGTCCCTC 
CCCCAGAGGG GGAGGGGACC CGAAGTTGCC CGGAATTGCT CGAATAACCT TCCCAAAAGG
GGGGTGCCGG ATTCCTCGTT TCCCCCCCGA CGTTTCCTGG TCATCCGTCC CGGCGGCATC
GGCGACGCGG TCCTCCTGGT CCCGGCGTTG ACGGCGCTGC AAAAAGCTTT TCCCGGCTGC
CGCATAGACG TTCTCGCCGA AAGCCGCAAC GCCGCCGCCT TTCTCATGTG CCCCGGGCTG
AACTGGGTGT ACCGCTACGA TTGTCTTTCC GACATGGCGG CTTTGCTCCG CACCCCTTTC
GACGTGGTGA TCGACACCGA GCAGTGGTAC CGCCTCTCCG CGGTCATCGC CAGGGTGGTC
CGCGCCCGGC GCTCCATCGG TTTTTGCAGC AACGAAAGGG GGAGGCTCTT CACCGACCCC
GTGCCTTACC CCTTGCAGGA TTACGAACTC CTCTCCTTCT TCAAGCTCCT AGCCCCGCTC
AAGGTGCAGC CTCCCCCGGA ACTGCCGGCT CCCTTTCTTG AACTCCCCGC CGGGGCGAAG
GAAGGGGCGC GGCGACTTTT GGCCCCGCTG GCCGGACAGA AATTCGTCGC CATCTTCCCC
GGAGCGAGCG TTGCCGAGAA ACAATGGGGG AGGGAGAACT TCCGGCAGGT GGCGGAGAGC
CTTTTCGCGG CGGGGATCGC GGTGGTTGTA GTCGGCGCAG ACGACGCCCG CGCCTCGGGC
GACTTCATTG CCCGCGGCGG TCTTGCCCTG AACCTGGCGG GGAAGGGGGG GCTCATGGAA
AGCGCCGCCG TCCTCGCAAA GGCGCGAGTC CTATTAAGCG GCGACTCGGG GCTGTTGCAC
ATCGCCGCGG GGCTTGGCAC CGCGACCGTT TCGCTTTTCG GTGCCAGCGA CGCAGCCAAG
TGGGCTCCCA AGGGCGAACG GCACGCCGTA TTCAGTTCGT CGCTTTCCTG CGCCCCCTGC
TCCAGTTACG GAACCATCCG CTGCAGCGCG GGCGCCCGCT GTCTCGATGC CGCGCCGTCA
GAAGTGACCG CCGCGCTTTT GAGGCTGTGG GAATAG
 
Protein sequence
MRAVPQPRSC LRGVSLTLSL PQRGRGPEVA RNCSNNLPKR GVPDSSFPPR RFLVIRPGGI 
GDAVLLVPAL TALQKAFPGC RIDVLAESRN AAAFLMCPGL NWVYRYDCLS DMAALLRTPF
DVVIDTEQWY RLSAVIARVV RARRSIGFCS NERGRLFTDP VPYPLQDYEL LSFFKLLAPL
KVQPPPELPA PFLELPAGAK EGARRLLAPL AGQKFVAIFP GASVAEKQWG RENFRQVAES
LFAAGIAVVV VGADDARASG DFIARGGLAL NLAGKGGLME SAAVLAKARV LLSGDSGLLH
IAAGLGTATV SLFGASDAAK WAPKGERHAV FSSSLSCAPC SSYGTIRCSA GARCLDAAPS
EVTAALLRLW E