Gene TM1040_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2054 
Symbol 
ID4077981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2156958 
End bp2158784 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content62% 
IMG OID638007373 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_614048 
Protein GI99081894 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC CAACCCGACA CCCGACCCCC AACGTCACAA CGGGCGCCCT GCCCGCCTCC 
CGTAAGATCT ATGTGGCAGG CGAGATGCAT GAAGGCATCC GCGTGCCGAT GCGCGAAATC
GCCACCCACC CGACTGCGGG CGAGGCGCCG CTACCGGTTT ATGACAGCTC CGGCCCCTAT
ACCGACCCCG ATGTGTCAAC CGACATCCGC GAGGGATTGG CCCCGTTGCG CGCGGCGTGG
ATCAAGGCCC GCTGCGATGT TGAGGCCTAT AGTGGCCGCG ATGTGACCGC TGCAGACAAT
GGCTTTGTCG AAGGCGATCG GCTGGTGCCT GAGTTTCCTG TAAAACCGGC GCCCCTGCGC
GGGAGGCAGC GGGCCGCTCC GACTCAGCTG GCCTATGCGC GCGCAGGAAT CATTACGCCC
GAGATGGAAT TCGTGGCGAT CCGAGAAAAC CAATTACGCG ATCTGTCGCC CTGTCATTGT
GCGCGCGACG GGGAGGATTT TGGCGCCAAT ATCCCCGACT ACGTGACGCC AGAGTTTGTC
CGCGCTGAAA TCGCCGCAGG GCGCGCCATC ATCCCCGCCA ATATCAACCA CCCGGAGCTG
GAGCCGATGA TCATCGGCCG CAATTTCAAG GTGAAGATCA ACGCCAATAT GGGCACCTCT
TCGGTGACCT CCTCGATGGA GGAGGAAGTG GACAAGCTGG TCTGGGCGAT CCGCTGGGGC
GCAGATACGG TGATGGATCT CTCGACGGGC CGCAACATCC ACAACACCCG CGAATGGATC
ATCCGCAACA GCCCGGTGCC CATCGGCACC GTGCCGATCT ATCAGGCGCT GGAGAAGGTG
AACGGCATCG CCGAGGATCT GACATGGGAG GTGTTTCGCG ACACGCTGAT CGAGCAGGCG
GAACAGGGCG TCGATTACTT TACCATTCAT GCGGGCGTCC GGCTGCATAT GGTGCCGATG
ACCGTGGAAC GCGTGACGGG GATCGTGTCG CGCGGTGGGT CGATCATGGC GAAATGGTGC
CTGCATCACC ATCGCGAGAG CTTCCTCTAT GAGCACTTCG AGGAGATCTG CGACATCTGT
CGGCAGTATG ACGTGAGCTT CTCGCTGGGT GATGGCTTGC GTCCCGGGTC GATTGCCGAT
GCCAATGACG CCGCGCAATT TGCCGAGCTG GAAACGCTGG GCGAGCTCAC AAAAATCGCC
TGGGCCAAGG ACTGTCAGGT GATGATCGAG GGGCCGGGCC ATGTGGCGAT GCACAAGATC
AAGGAGAACA TGGACAAGCA GCTGGAGTGC TGCCACGAGG CGCCGTTTTA CACGCTCGGC
CCGCTCACCA CGGATATTGC GCCGGGGTAC GATCACATCA CCTCCGGGAT TGGCGCGGCG
ATGATCGGCT GGTTTGGCTG CGCGATGCTG TGTTACGTGA CGCCCAAGGA ACATCTGGGC
CTGCCGGACC GCGATGACGT TAAAACCGGG GTGATCACCT ACAAGATCGC CGCCCATGCC
GCTGATCTGG CCAAGGGTCT GCCCGGCGCG CAGCGGCGCG ATGATGCGCT GAGCCGCGCG
CGGTTTGAAT TCCGCTGGGA GGATCAGTTC AACCTCTCGC TCGACCCGGA GACCGCGCAG
AGCTTCCACG ACGAGACCCT GCCCAAGGAG GCCCACAAGG TCGCGCATTT CTGTTCCATG
TGCGGGCCAA AGTTCTGCTC CATGCGGATC AGCCACGACA TCCGCGCCGA AGCCCAGAAA
GAAGGGTTTG AGGCGATGGC GGCAAAATTC CGCGAAGGCG GTGAACTCTA TGTACCACTG
AAGGACACCG CACCAGAAGA GGCCTGA
 
Protein sequence
MNTPTRHPTP NVTTGALPAS RKIYVAGEMH EGIRVPMREI ATHPTAGEAP LPVYDSSGPY 
TDPDVSTDIR EGLAPLRAAW IKARCDVEAY SGRDVTAADN GFVEGDRLVP EFPVKPAPLR
GRQRAAPTQL AYARAGIITP EMEFVAIREN QLRDLSPCHC ARDGEDFGAN IPDYVTPEFV
RAEIAAGRAI IPANINHPEL EPMIIGRNFK VKINANMGTS SVTSSMEEEV DKLVWAIRWG
ADTVMDLSTG RNIHNTREWI IRNSPVPIGT VPIYQALEKV NGIAEDLTWE VFRDTLIEQA
EQGVDYFTIH AGVRLHMVPM TVERVTGIVS RGGSIMAKWC LHHHRESFLY EHFEEICDIC
RQYDVSFSLG DGLRPGSIAD ANDAAQFAEL ETLGELTKIA WAKDCQVMIE GPGHVAMHKI
KENMDKQLEC CHEAPFYTLG PLTTDIAPGY DHITSGIGAA MIGWFGCAML CYVTPKEHLG
LPDRDDVKTG VITYKIAAHA ADLAKGLPGA QRRDDALSRA RFEFRWEDQF NLSLDPETAQ
SFHDETLPKE AHKVAHFCSM CGPKFCSMRI SHDIRAEAQK EGFEAMAAKF REGGELYVPL
KDTAPEEA