Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2118 |
Symbol | |
ID | 4029263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2386882 |
End bp | 2388372 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637967319 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_574168 |
Protein GI | 92114240 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTATC GCGTCAAGCT GTTCCCCGAG ATCACCATCA AGTCAAGGCC CGTGCGCAAG GAGATGGTCC GCTGCCTGCG CACCAACCTG CGCAACAGCC TTGCGCCCCT GGTGCCCGAT GTACGCATCG CCGACTGCTG GGATGCGCTG GAGGTACGCA TTCGCCGCCC CATCGACGAG ACGACCCGCG AACGGGTGGA AGGCGTCTTG TCCCGCACCT CCGGCATTCA CGAGACGCTG GTGATCGAGG CCTATACGTT CACCTCTTTC GATGACACCG CCGCACACCT GGCGGCGTTA TGGCAGGAGG CACTGGCAGG CCAGCGCTTT CGGGTCAGCG TCAAGCGCCG AGGACAACAT GACTTCACCT CGGCGGAGCT CGAGCGTTAC CTGGGAGGCG CCCTTCTTGC CGCCGCTCCG CAGGCCAAGG TGGATCTCAC GCAGCCCGAC GTGGATGTCA CTCTCGAACT GCAGGACCAG CATCTGCGCC TGGTCACACG GCGCCTTCCC GGCCTGGGCG GCTATCCGCT GGGTACCCAG GGTCAGGCGC TGGCGCTCAT CTCGGGCGGC TACGATTCCC CGGTAGCCGC CTGGCGCATG ATCCGCCGTG GCCTCAAGAC CCACTACCTG TTCTTCAATC TCGGCGGGGC CGCGCATGAA CGCACCGTAC GCGAGGTCGT CCATCGCGTG TGGCAAGGCT ATAGCGCCTC TCATCGCGTG CATTTCATCA GCGTGCCGTT CGAGGAGGTG CTCGACGAGA TCCAGCGACG CGTGCCTGCC GGCCTGGCGG GGGTCGTGCT CAAGCGGATG ATGCTGCGCG CCGCCAATCG CGTTGCGGCG CGGGCACGCA TTCCCGCCCT GATCACCGGG GATGCCCTGG CCCAGGTCTC CAGCCAGAGC GTCACCAACC TGGGGCTGAT CGATCGCGTC AGCGAGCGCC CCGTGCTGCG TCCGCTCATC GCCGATGACA AGCAGCGCAT CATCGAGGAT GCTCAGCGCA TCGGCACGGC CGAGCATGCC GAACGCCTGC CGGAATATTG CGGCACGATT TCACGGCGCC CCAACACGCG CCCCAAGCTC GCCCAGATCG AGGCCGCCGA GGCCGATTTC GACATGAGCG TCCTCGATGC CGCCCTGGAG GCCGCGAGTC GCACCCGGGT GGATCGCCTC CTCGATGCAC CGCCGTCACG TCCGCTGGAT GTTCCCGTGA TCGCCTCCGC CGACGCACTG CGCGCAGCCG AGGACGTCAC GGTGATCGAC ATTCGCCACC CCGATGAACG CGATGCCGCA CCGCTGAAGC TGCCGCATGG CGAGCCGCTG GCGATTCCGT TTTACGAACT CGCCGCACGC GCCGCGCAGC TTTCAGGCCG GCAACGCTAC GCGCTCTATT GCGCCCAGGG CGTGATGAGC AAGATGCAGG CGCTTCGGCT CGCCGACCAG GGCCTCGACA ACTTCCTCGT CTATCGGGAC GACGCTCACG CCGAGCGCTG A
|
Protein sequence | MRYRVKLFPE ITIKSRPVRK EMVRCLRTNL RNSLAPLVPD VRIADCWDAL EVRIRRPIDE TTRERVEGVL SRTSGIHETL VIEAYTFTSF DDTAAHLAAL WQEALAGQRF RVSVKRRGQH DFTSAELERY LGGALLAAAP QAKVDLTQPD VDVTLELQDQ HLRLVTRRLP GLGGYPLGTQ GQALALISGG YDSPVAAWRM IRRGLKTHYL FFNLGGAAHE RTVREVVHRV WQGYSASHRV HFISVPFEEV LDEIQRRVPA GLAGVVLKRM MLRAANRVAA RARIPALITG DALAQVSSQS VTNLGLIDRV SERPVLRPLI ADDKQRIIED AQRIGTAEHA ERLPEYCGTI SRRPNTRPKL AQIEAAEADF DMSVLDAALE AASRTRVDRL LDAPPSRPLD VPVIASADAL RAAEDVTVID IRHPDERDAA PLKLPHGEPL AIPFYELAAR AAQLSGRQRY ALYCAQGVMS KMQALRLADQ GLDNFLVYRD DAHAER
|
| |