Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_2806 |
Symbol | |
ID | 4117638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 2814913 |
End bp | 2816421 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 638037577 |
Product | thiamine biosynthesis protein |
Protein accession | YP_645531 |
Protein GI | 108805594 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGA CGGCCGAGAG GAAGACGGAG ACGGCCCGGG CGGCCGCGGG TCGCCCGGGG TTCCGGCGGG GGCTTCTGGT GCGGATGGGG GGTGAGATCT ACACCAAGTC CTCCAGGACG CGCCGGAGGT TCCTGCGGGT GCTGGTGAAC AACATCTACC TGGCGCTGCG CGAGAGCGGC ATCAGGGCCT CCATCCGGCC CGAGTGGAGC CGGGTGCTCG TCTACGCGGA CGACCTGCCC CGGGCGCGGG AGGTGCTCAC CCGGGTGTTC GGGGTCTACG CGGTGGCCGA GGCCCTCGAG GTGCCCTACG CCTCGCTCGA GGACCTGGTG GAGAAGGTCG CCCCCCTGTT CCGGGAGCAC GTGGCGGGCA AGACCTTCGC CGTGCGGGCC CGCCGGCGCC GGGCGCCGTT CACCTCCCAG GACGTGGGGC GCGAGCTGGG GGCAGCCCTG CTGCCCTTCT CGGCCGGCGT AGACCTGGAC GACCCCGAGG CCGAGGTGAG GCTGGAGGTC GGCAGGGACC GGGCCTTCGT CCTCCTGGAG GAGTCCAGGG GCCCCGGCGG CTTCCCCGCC GGCACCGGCG GGCGGGCGCT CGCCCTCTTC TCCGGGGGCT TCGACTCCCC GGTGGCCGCC TGGCGGGTGA TGCGGCGCGG CATCCGGGTC GACCTGGTGG TCTACGACCT GGGCGGCTGC GGCCAGGTCG GGCAGGCGCT CGCGGTGGCG CGGGAGCTCG CGCTGCGCTG GGCGCCGGGG CTGGAGGTGC GGGCCAACGT GGTCGACCTC GCCCCGGTCG TCTCCGCCCT GGTCAGGCGG GTCGACCCGC GGCTGCGCCA GATCCTGCTC AAGCGCGCCA TGTACCGCGC GGGCTCCATC CTGGCCGGGG AGCTGGGCTT CGAGGCGCTG GTGACCGGCG AGTCCCTGGG GCAGGTCTCC ACCCAGACCC TGCGCAACCT CGCGGTCGCC GAGGAGGCGG CGAGCGTGCC CGTGCTCCGG CCGCTGGTGG GCTCCGACAA GCAGGAGATC ATCGAGGCCG CCCGCAGCAT CGGCACCCAC GACGTCTCGG CGCTCGTCAA GGAGCACTGC TCCATCGCCA CCGGGCCGGT GGAGACCTGG GCCGACCCCG AGGAGGTGCT CACCGCCGAG AGCGGCCTCG ACCAGGACGT GGACGAGGCC TGGCTCCGCC GGGCGGTGGA GAACCGCCGG GTCATACGCC TGAAGAGCTG GGACCCCGCC GGGGAGGAGA GCCCGGGCTA CGTGGTGGAC CGGGTGCCCG AGGGGGCCGT GGTGGTGGAC ATCCGGGAGC CCGGGGAGGG CGAGCCCGTA GGCGATCTCC GGCTGCCCTT CTCGCGGGCC ATGGAGTCCC TCGAGGAGCT CGACCCGTCG CGCGAGTACC TGCTCGTGTG CGCCAGCGGG CGGCGCTCGG AGCTTTTGGC GCGCGAGATG ATCGGCCGCG GCTACAGGGC CTACAGCCTG GAGGGCGGGG CCGGGAGGCT GGCCGCCGCC CCCTCCTAG
|
Protein sequence | MTETAERKTE TARAAAGRPG FRRGLLVRMG GEIYTKSSRT RRRFLRVLVN NIYLALRESG IRASIRPEWS RVLVYADDLP RAREVLTRVF GVYAVAEALE VPYASLEDLV EKVAPLFREH VAGKTFAVRA RRRRAPFTSQ DVGRELGAAL LPFSAGVDLD DPEAEVRLEV GRDRAFVLLE ESRGPGGFPA GTGGRALALF SGGFDSPVAA WRVMRRGIRV DLVVYDLGGC GQVGQALAVA RELALRWAPG LEVRANVVDL APVVSALVRR VDPRLRQILL KRAMYRAGSI LAGELGFEAL VTGESLGQVS TQTLRNLAVA EEAASVPVLR PLVGSDKQEI IEAARSIGTH DVSALVKEHC SIATGPVETW ADPEEVLTAE SGLDQDVDEA WLRRAVENRR VIRLKSWDPA GEESPGYVVD RVPEGAVVVD IREPGEGEPV GDLRLPFSRA MESLEELDPS REYLLVCASG RRSELLAREM IGRGYRAYSL EGGAGRLAAA PS
|
| |