Gene TM1040_3348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3348 
Symbol 
ID4075247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp359443 
End bp361311 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content58% 
IMG OID638004856 
Productthiamine pyrophosphate enzyme, central region 
Protein accessionYP_611582 
Protein GI99078324 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0194169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.142166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCGC AAGACGACAC CATCCGCCTG ACCACAGCAC AGGCCATCAT CCGCTGGCTG 
TGCAACCAGT ACATTGAAAT CGATGGCGAA GAGATGCGCC TGTGTGGCGG TGGCTTCGGC
ATCTTTGGAC ATGGCAATGT GACCTGTCTG GGCGAGGCGT TGAACACCGT CCGCGATGCG
CTTCCGCTCT ATCGCGGACA GAATGAACAA AGCATGGGAT TTGCAGCTGC AGGATATGCC
AAGACCTGGT TACGGCAGCG GTTCATGTTC TGCACCGCAA GTGCGGGGCC GGGCACCGCC
AATCTCGTGA CGGCGGCAGG GCTGGCTCAT GCCAATCGCT TGCCGATGCT GATGCTCTGC
GGAGATACCT TCCTCACCCG CCTGCCGGAC CCAGTTCTGC AACAAATGGA AAACTTCAAC
GATCCGACCT TTGGCGTGAA CGACGCGTTC AAACCGGTCA GCCGCTTTTG GGATCGAATC
TGCCATCCAG CACAGGTGAT TCAATCGCTC CCGGCTGCGA TTGCAACCAT GCTTGATCCG
GCGGATTGCG GCCCGGCCTT TATCGGCCTG CCTCAGGACG TGCAGGGCTG GACCTATGAT
TATCCAACTA AATTCTTTGA CAAAAAGGTA CATCGCATTC GCCGTCAGGC ACCAGATGCA
GATGAAATCG CAGATGCGGT CGCCGCGTTA GCCGCGGCGG AGCGCCCCAT CCTGATTGCA
GGTGGTGGCG TCCAATACTC TCGCGCCGCA GAAACACTCC GTGAATTTGC CGAGACCCAC
CAGATCCCGG TGGTCGAAAC CATCGCGGGC CGCGCCAATA TGCGGGCGAC ACACCCCCTG
AACATCGGCC CGATTGGGGT GACAGGGTCA GACAGCGCAA ATGCAATCGC GGAAAAGGCC
GATGTGATCG TCGCCGTTGG CACCCGGTTG CAAGACTTCA CCACCGGATC CTGGACAGCC
TTTTCAAAGG ACGCTCAGTT TATCTCGATC AACGCCGCGC GCCATGATGC AGGCAAGCAT
ATGTCCTTGC CAGTTGTGGG CGATGCCAAG TTGAGCCTCG CCGCAATCGA GGCCGCAACA
GACTACAATG CACCCCTAGA TTGGGTCGCT TATGCGCAGG ATCAGCGCAG CAAGTGGGAC
CGATATGTGG TCGAGAACAC CGCCCACGGC AATCGCCCTA ATTCCTATGC CCAGGCGATC
GGCGTCGTGA ACGACCTCTG CGATCCGCGC GACCGCGTAG TGGCCGCTGC GGGTGGGTTG
CCGGCTGAGG TCACAGCAAA CTGGCGCACG CTTGATATCG GCACGGTGGA TGTGGAGTTT
GGCTTTTCAT GCATGGGGTA CGAGATTGCT GGCGCCTGGG GCGCGCGCAT TGCTCAGTCG
CAGATGGAAC CCGATCAGGA TGTCATCACT TTCTGCGGGG ATGGCTCTTA TATGATGCTG
AACTCGGACA TTTATTCCAG TGTCCTCTCA GGCAAAAAAA TGATTGTTCT CGTGCTCGAC
AACGGAGGCT TTGCGGTCAT CAATAAACTG CAAAACAACA CAGGAAATCA GAGCTTTAAC
AACCTTCTTG CGGATTGCCC GACCATTGCA GAACCCTTCG GCGTGGATTT TGTCGCCCAT
GCCGCCTCGA TGGGGGCGAT GGCCGAAAAA GTCGCCAACC CGGCAGAACT AGGCGAGGCA
TTCAAACGCG CGAAGGCTGC CGACAAGACC TATGTGATCG TAATGGATGT GGACCCCTAT
GAGGGATGGA CCACCCAAGG GCATGCATGG TGGGAAGTCG GAACGCCCCA TATCACCGAG
GATGAAGCCG TCAGGACTGC GCATCTGGAC TGGGAGTCCT CGCGCAGCAA ACAGCGAAAG
GGCATCTGA
 
Protein sequence
MSAQDDTIRL TTAQAIIRWL CNQYIEIDGE EMRLCGGGFG IFGHGNVTCL GEALNTVRDA 
LPLYRGQNEQ SMGFAAAGYA KTWLRQRFMF CTASAGPGTA NLVTAAGLAH ANRLPMLMLC
GDTFLTRLPD PVLQQMENFN DPTFGVNDAF KPVSRFWDRI CHPAQVIQSL PAAIATMLDP
ADCGPAFIGL PQDVQGWTYD YPTKFFDKKV HRIRRQAPDA DEIADAVAAL AAAERPILIA
GGGVQYSRAA ETLREFAETH QIPVVETIAG RANMRATHPL NIGPIGVTGS DSANAIAEKA
DVIVAVGTRL QDFTTGSWTA FSKDAQFISI NAARHDAGKH MSLPVVGDAK LSLAAIEAAT
DYNAPLDWVA YAQDQRSKWD RYVVENTAHG NRPNSYAQAI GVVNDLCDPR DRVVAAAGGL
PAEVTANWRT LDIGTVDVEF GFSCMGYEIA GAWGARIAQS QMEPDQDVIT FCGDGSYMML
NSDIYSSVLS GKKMIVLVLD NGGFAVINKL QNNTGNQSFN NLLADCPTIA EPFGVDFVAH
AASMGAMAEK VANPAELGEA FKRAKAADKT YVIVMDVDPY EGWTTQGHAW WEVGTPHITE
DEAVRTAHLD WESSRSKQRK GI