Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2054 |
Symbol | |
ID | 4077981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2156958 |
End bp | 2158784 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638007373 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_614048 |
Protein GI | 99081894 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCC CAACCCGACA CCCGACCCCC AACGTCACAA CGGGCGCCCT GCCCGCCTCC CGTAAGATCT ATGTGGCAGG CGAGATGCAT GAAGGCATCC GCGTGCCGAT GCGCGAAATC GCCACCCACC CGACTGCGGG CGAGGCGCCG CTACCGGTTT ATGACAGCTC CGGCCCCTAT ACCGACCCCG ATGTGTCAAC CGACATCCGC GAGGGATTGG CCCCGTTGCG CGCGGCGTGG ATCAAGGCCC GCTGCGATGT TGAGGCCTAT AGTGGCCGCG ATGTGACCGC TGCAGACAAT GGCTTTGTCG AAGGCGATCG GCTGGTGCCT GAGTTTCCTG TAAAACCGGC GCCCCTGCGC GGGAGGCAGC GGGCCGCTCC GACTCAGCTG GCCTATGCGC GCGCAGGAAT CATTACGCCC GAGATGGAAT TCGTGGCGAT CCGAGAAAAC CAATTACGCG ATCTGTCGCC CTGTCATTGT GCGCGCGACG GGGAGGATTT TGGCGCCAAT ATCCCCGACT ACGTGACGCC AGAGTTTGTC CGCGCTGAAA TCGCCGCAGG GCGCGCCATC ATCCCCGCCA ATATCAACCA CCCGGAGCTG GAGCCGATGA TCATCGGCCG CAATTTCAAG GTGAAGATCA ACGCCAATAT GGGCACCTCT TCGGTGACCT CCTCGATGGA GGAGGAAGTG GACAAGCTGG TCTGGGCGAT CCGCTGGGGC GCAGATACGG TGATGGATCT CTCGACGGGC CGCAACATCC ACAACACCCG CGAATGGATC ATCCGCAACA GCCCGGTGCC CATCGGCACC GTGCCGATCT ATCAGGCGCT GGAGAAGGTG AACGGCATCG CCGAGGATCT GACATGGGAG GTGTTTCGCG ACACGCTGAT CGAGCAGGCG GAACAGGGCG TCGATTACTT TACCATTCAT GCGGGCGTCC GGCTGCATAT GGTGCCGATG ACCGTGGAAC GCGTGACGGG GATCGTGTCG CGCGGTGGGT CGATCATGGC GAAATGGTGC CTGCATCACC ATCGCGAGAG CTTCCTCTAT GAGCACTTCG AGGAGATCTG CGACATCTGT CGGCAGTATG ACGTGAGCTT CTCGCTGGGT GATGGCTTGC GTCCCGGGTC GATTGCCGAT GCCAATGACG CCGCGCAATT TGCCGAGCTG GAAACGCTGG GCGAGCTCAC AAAAATCGCC TGGGCCAAGG ACTGTCAGGT GATGATCGAG GGGCCGGGCC ATGTGGCGAT GCACAAGATC AAGGAGAACA TGGACAAGCA GCTGGAGTGC TGCCACGAGG CGCCGTTTTA CACGCTCGGC CCGCTCACCA CGGATATTGC GCCGGGGTAC GATCACATCA CCTCCGGGAT TGGCGCGGCG ATGATCGGCT GGTTTGGCTG CGCGATGCTG TGTTACGTGA CGCCCAAGGA ACATCTGGGC CTGCCGGACC GCGATGACGT TAAAACCGGG GTGATCACCT ACAAGATCGC CGCCCATGCC GCTGATCTGG CCAAGGGTCT GCCCGGCGCG CAGCGGCGCG ATGATGCGCT GAGCCGCGCG CGGTTTGAAT TCCGCTGGGA GGATCAGTTC AACCTCTCGC TCGACCCGGA GACCGCGCAG AGCTTCCACG ACGAGACCCT GCCCAAGGAG GCCCACAAGG TCGCGCATTT CTGTTCCATG TGCGGGCCAA AGTTCTGCTC CATGCGGATC AGCCACGACA TCCGCGCCGA AGCCCAGAAA GAAGGGTTTG AGGCGATGGC GGCAAAATTC CGCGAAGGCG GTGAACTCTA TGTACCACTG AAGGACACCG CACCAGAAGA GGCCTGA
|
Protein sequence | MNTPTRHPTP NVTTGALPAS RKIYVAGEMH EGIRVPMREI ATHPTAGEAP LPVYDSSGPY TDPDVSTDIR EGLAPLRAAW IKARCDVEAY SGRDVTAADN GFVEGDRLVP EFPVKPAPLR GRQRAAPTQL AYARAGIITP EMEFVAIREN QLRDLSPCHC ARDGEDFGAN IPDYVTPEFV RAEIAAGRAI IPANINHPEL EPMIIGRNFK VKINANMGTS SVTSSMEEEV DKLVWAIRWG ADTVMDLSTG RNIHNTREWI IRNSPVPIGT VPIYQALEKV NGIAEDLTWE VFRDTLIEQA EQGVDYFTIH AGVRLHMVPM TVERVTGIVS RGGSIMAKWC LHHHRESFLY EHFEEICDIC RQYDVSFSLG DGLRPGSIAD ANDAAQFAEL ETLGELTKIA WAKDCQVMIE GPGHVAMHKI KENMDKQLEC CHEAPFYTLG PLTTDIAPGY DHITSGIGAA MIGWFGCAML CYVTPKEHLG LPDRDDVKTG VITYKIAAHA ADLAKGLPGA QRRDDALSRA RFEFRWEDQF NLSLDPETAQ SFHDETLPKE AHKVAHFCSM CGPKFCSMRI SHDIRAEAQK EGFEAMAAKF REGGELYVPL KDTAPEEA
|
| |