Gene GM21_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3404 
Symbol 
ID8138771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3935621 
End bp3936676 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content66% 
IMG OID644871021 
Productlipopolysaccharide heptosyltransferase I 
Protein accessionYP_003023186 
Protein GI253701997 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones133 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGTTC TGATTGTCAA GATGTCCGCC TTGGGGGACG TGATACACGC CCTGCCGGTG 
CTGGATTATC TGACTCAAAC GGTGAAGGGG ATCGAGATCG ACTGGGTGGT CGAGGAGGCT
TTCCGGGACA TGCTCTCCGG CAACCCGCTC ATCTCCCGGC TGCACCTGGC CCGTTTCAAG
GCGTGGCGCA AGAAGCCCTT CGCTCCGGCC ACGCTCAGGG AGGTGAACGC CCTGGCGAAT
GCGCTCAAGG AGCGCGAGTA CGACATCGTC TTCGACCTGC AGGGGAACAT CAAGAGCGGC
ATCGTCACCA GGATCACCGG CTGCCCCAGG CGCTACGGCT TTGACCGGGA GGGGGTGCGC
GAGAGCCTCA ACGTCTACTG CACCACGAAC CAGATACCGC TCAGGCGCGC GGACCAGCAC
GTGAACGACC GGGCGCTGCG GGTGGTGAGC GTACCCTTCG GCAAGAACTA CCAGGGGATG
CAGCTTGCCA CGGATATCTA CACCCCGCCG GAGGACGATG CGGCGGCCGA GGCGTTTCTG
GCGACGCTCT CCGACGGGCT GGTCTTCGTG CTGCACCACG GCACCACCTG GAGCACCAAG
CACTGGCACC AGGAGGGGTG GATCTCGCTG GGGCAGGAGC TTTTGACGCT CTACCCGGAG
GCCACCATCC TCCTTTCCTG GAGTGGCGAG ACCGAGCACG AGGGTGCCAA GGAGATCGCC
GCAGGGATCG GGAGCCAGGT GCGGGTGCTT CCCAAGCTCA CCCTGAAGGG GTTCAGCGCG
CTCTTGAAAA AGGTCGACCT GGTCCTTGGC GGGGATACCG GTCCCATCCA CATCGCCGCC
GCCGTCGGGA CCCCCACGGT CAGCCTGTAC CGAGCCACCG ACGGGGCCCG CAACGCGCCC
AGGGGAGAGC ACCGGGCGGT GCAGTCACCG CTTTCCTGCG CCAAGTGCCT GCGCCGCTCC
TGCGACCGGG ACGACGAGTG CCGCCGGAGC ATCCAGGTGA AGGCCATGCT GCAGGCCTGC
CGGGAACTGC TGAGTAATAC GAGTACCCCG CTTTAG
 
Protein sequence
MRVLIVKMSA LGDVIHALPV LDYLTQTVKG IEIDWVVEEA FRDMLSGNPL ISRLHLARFK 
AWRKKPFAPA TLREVNALAN ALKEREYDIV FDLQGNIKSG IVTRITGCPR RYGFDREGVR
ESLNVYCTTN QIPLRRADQH VNDRALRVVS VPFGKNYQGM QLATDIYTPP EDDAAAEAFL
ATLSDGLVFV LHHGTTWSTK HWHQEGWISL GQELLTLYPE ATILLSWSGE TEHEGAKEIA
AGIGSQVRVL PKLTLKGFSA LLKKVDLVLG GDTGPIHIAA AVGTPTVSLY RATDGARNAP
RGEHRAVQSP LSCAKCLRRS CDRDDECRRS IQVKAMLQAC RELLSNTSTP L