Gene GM21_3411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3411 
Symbol 
ID8138778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3942713 
End bp3943819 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID644871028 
Productlipopolysaccharide heptosyltransferase II 
Protein accessionYP_003023193 
Protein GI253702004 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones143 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGG CCGCCATGAC TGACAACCGT ACGGGGAAAA GGATCCTGGT GCTGCGCTAC 
CGCTTCATCG GCGACACCAT CCTCACCGTC CCGTTTCTGC GGAACCTGCG CCGCGCCGAG
CCGGACGCCT ACATCGCCTG GGTCGTCGCC CCCGGCTCCT CCGAGGTGAT CCAGGGGACA
CCCTACGTCG ACGAGCTCAT CTTCTGGGAT CCCCCCACCA TCCATGCCGA CAGCCGCTCC
ACCCACAAGA CGCTAGGCGA CAAGTTAGGC TTCATCAGGG AGCTGCGCGC CCGCCGCTTC
GACAAGGTCT ACGTGCTCAA GCGCTCCTTC GGCAGCGCCA TCATTGGCCT TCTCTCCGGC
GCCTCGAAAC GGATCGGCTT CGCCACCGAG GGGCGGAACT TCCTGTTGAC CAAGGGGGTC
CCCTACCGGC ACGGGCAGCA CGAGGTGCAA AACTTCCTGG ACGTCTTGCG CGCCGACGGC
GTGCCGGTCG TGGACGATCA TCTCGAGGCG TGGCTTTCGG CCGAGGAGAA GGCTTTCGCG
GACGACTTCT TCCGGCAGCG CGGCGTCTCC GCGGACGAGC TGGTGATCGG GATGCACCCC
TTCGCCGCCA ACCCGCCGCG CGCCTGGCAC CTGGACAACT TCACCGAACT GGCGCGCGCC
CTGCAAAAGC GCTATCGCTG CCGGATCATG TTCTTCGGCG GCCCCCGGGA CAAGGAGGCG
CTCGACGCGA TACGCGGCGG GCTGGACGTG CCGCCCCTTG AGGCGGTCGG CTCGACCACG
CTGCGCCAGA CCATGGCCCT TCTCTCCCGC TGCGGCGCCC TTGTCTGCAA CGACAGCGGC
ATCATGCATC TCGCCGCCTC GCTGCAGGTG CCGCTGGTCG CGCTTTTCGG CCCGCAGTCG
CCGGTCAAGT TCGGCCCCTG GGGGACCGCG TGCCGCGTGG TGCGCCACGA CTTCCCCTGC
GGCCCATGCC GCCAGAGGTT CTTCACCGAG TGCGAGCCGT CGGAGCGCGG GAGGCCCGCC
TGCATCGAGG CGATCACGGT GGACGAAGTG CTGGCTGAAA TCGAAGCCCT GCTCGCGGCG
GGGGATAGGG AGCACACGGA TAGATGA
 
Protein sequence
MTKAAMTDNR TGKRILVLRY RFIGDTILTV PFLRNLRRAE PDAYIAWVVA PGSSEVIQGT 
PYVDELIFWD PPTIHADSRS THKTLGDKLG FIRELRARRF DKVYVLKRSF GSAIIGLLSG
ASKRIGFATE GRNFLLTKGV PYRHGQHEVQ NFLDVLRADG VPVVDDHLEA WLSAEEKAFA
DDFFRQRGVS ADELVIGMHP FAANPPRAWH LDNFTELARA LQKRYRCRIM FFGGPRDKEA
LDAIRGGLDV PPLEAVGSTT LRQTMALLSR CGALVCNDSG IMHLAASLQV PLVALFGPQS
PVKFGPWGTA CRVVRHDFPC GPCRQRFFTE CEPSERGRPA CIEAITVDEV LAEIEALLAA
GDREHTDR