Gene TM1040_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1111 
Symbol 
ID4077818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1192007 
End bp1193008 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content63% 
IMG OID638006415 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_613106 
Protein GI99080952 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA AAGTTGGAAT CAATGGCTTC GGACGTATCG GCCGCTGCAC CCTCGCCCAT 
ATCGCGGGCA GCTTTCGCAA CGATATCGAG GTGATCAAGG TCAATGCGAC CGGCCCGATC
GAGACGGCAG CGCATCTCAT CAAATACGAC AGCGTGCATG GCCGTTTCCC CGGTCAGGTC
GATTTCAGCG AAGGCCGCCT GAACCTTGGC CGCGGCGACA TGCAGGTGTT TTCCACCTAC
GACATGGACA CGCTGGACTG GGAAGGCTGC GATGTCGTGC TGGAATGCAC CGGCAAATTC
AACGATGGCC TGAAGGCGAA GAAGCACCTC GAGCGCGGCG CAGGCAAGGT TCTGCTGTCC
GCACCTGGCA AGAACGTGGA CCGCACCGTG GTTTACGGCG TCAACGACGA GGATCTTCTG
TCCACCGACA AGATGGTCTC CAACGGGTCT TGCACCACCA ACTGCCTTGC ACCGCTGGCC
AAGGTACTGG ACGAGGCCTT TGGCATCGAG CATGGCATCA TGACCACCAT CCACGCCTAC
ACCGGCGATC AGCCGACGCT AGACCGCCGT CACGACGATC TCTACCGCGC CCGTGCGGCA
GCGATGTCGA TGATCCCGAC CTCTACGGGC GCGGCCAAGG CGCTGGGTGA GGTTCTGCCG
AACCTCAAGG GCCGTCTCGA TGGCTCAGCG ATCCGCGTGC CGACCCCGAA TGTGTCCGCG
GTGGACCTGA CCTTCCGCGC GGGGCGCGAG GTCACCGCCG AAGAGGTGAA CGCCGCCGTG
GCAGAGGCGT CCAAGGGCCA TATGTCGCGC GTTCTGGGCT ATGAGCCCGC GCCGCTGGTC
TCGACCGACT TCAACCACAC CGAAGAAAGC TCCATCTTTG CGCCCGACCA GACCCGCGTG
GTCGACGGTC GCATGGTGCG CGTGCTGGCG TGGTATGACA ACGAATGGGG CTTCTCGGTC
CGGATGGCCG ATGTTGCCAC CGCCATGGGT CGCCTGAACT AA
 
Protein sequence
MTIKVGINGF GRIGRCTLAH IAGSFRNDIE VIKVNATGPI ETAAHLIKYD SVHGRFPGQV 
DFSEGRLNLG RGDMQVFSTY DMDTLDWEGC DVVLECTGKF NDGLKAKKHL ERGAGKVLLS
APGKNVDRTV VYGVNDEDLL STDKMVSNGS CTTNCLAPLA KVLDEAFGIE HGIMTTIHAY
TGDQPTLDRR HDDLYRARAA AMSMIPTSTG AAKALGEVLP NLKGRLDGSA IRVPTPNVSA
VDLTFRAGRE VTAEEVNAAV AEASKGHMSR VLGYEPAPLV STDFNHTEES SIFAPDQTRV
VDGRMVRVLA WYDNEWGFSV RMADVATAMG RLN