Gene Csal_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2118 
Symbol 
ID4029263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2386882 
End bp2388372 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID637967319 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_574168 
Protein GI92114240 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTATC GCGTCAAGCT GTTCCCCGAG ATCACCATCA AGTCAAGGCC CGTGCGCAAG 
GAGATGGTCC GCTGCCTGCG CACCAACCTG CGCAACAGCC TTGCGCCCCT GGTGCCCGAT
GTACGCATCG CCGACTGCTG GGATGCGCTG GAGGTACGCA TTCGCCGCCC CATCGACGAG
ACGACCCGCG AACGGGTGGA AGGCGTCTTG TCCCGCACCT CCGGCATTCA CGAGACGCTG
GTGATCGAGG CCTATACGTT CACCTCTTTC GATGACACCG CCGCACACCT GGCGGCGTTA
TGGCAGGAGG CACTGGCAGG CCAGCGCTTT CGGGTCAGCG TCAAGCGCCG AGGACAACAT
GACTTCACCT CGGCGGAGCT CGAGCGTTAC CTGGGAGGCG CCCTTCTTGC CGCCGCTCCG
CAGGCCAAGG TGGATCTCAC GCAGCCCGAC GTGGATGTCA CTCTCGAACT GCAGGACCAG
CATCTGCGCC TGGTCACACG GCGCCTTCCC GGCCTGGGCG GCTATCCGCT GGGTACCCAG
GGTCAGGCGC TGGCGCTCAT CTCGGGCGGC TACGATTCCC CGGTAGCCGC CTGGCGCATG
ATCCGCCGTG GCCTCAAGAC CCACTACCTG TTCTTCAATC TCGGCGGGGC CGCGCATGAA
CGCACCGTAC GCGAGGTCGT CCATCGCGTG TGGCAAGGCT ATAGCGCCTC TCATCGCGTG
CATTTCATCA GCGTGCCGTT CGAGGAGGTG CTCGACGAGA TCCAGCGACG CGTGCCTGCC
GGCCTGGCGG GGGTCGTGCT CAAGCGGATG ATGCTGCGCG CCGCCAATCG CGTTGCGGCG
CGGGCACGCA TTCCCGCCCT GATCACCGGG GATGCCCTGG CCCAGGTCTC CAGCCAGAGC
GTCACCAACC TGGGGCTGAT CGATCGCGTC AGCGAGCGCC CCGTGCTGCG TCCGCTCATC
GCCGATGACA AGCAGCGCAT CATCGAGGAT GCTCAGCGCA TCGGCACGGC CGAGCATGCC
GAACGCCTGC CGGAATATTG CGGCACGATT TCACGGCGCC CCAACACGCG CCCCAAGCTC
GCCCAGATCG AGGCCGCCGA GGCCGATTTC GACATGAGCG TCCTCGATGC CGCCCTGGAG
GCCGCGAGTC GCACCCGGGT GGATCGCCTC CTCGATGCAC CGCCGTCACG TCCGCTGGAT
GTTCCCGTGA TCGCCTCCGC CGACGCACTG CGCGCAGCCG AGGACGTCAC GGTGATCGAC
ATTCGCCACC CCGATGAACG CGATGCCGCA CCGCTGAAGC TGCCGCATGG CGAGCCGCTG
GCGATTCCGT TTTACGAACT CGCCGCACGC GCCGCGCAGC TTTCAGGCCG GCAACGCTAC
GCGCTCTATT GCGCCCAGGG CGTGATGAGC AAGATGCAGG CGCTTCGGCT CGCCGACCAG
GGCCTCGACA ACTTCCTCGT CTATCGGGAC GACGCTCACG CCGAGCGCTG A
 
Protein sequence
MRYRVKLFPE ITIKSRPVRK EMVRCLRTNL RNSLAPLVPD VRIADCWDAL EVRIRRPIDE 
TTRERVEGVL SRTSGIHETL VIEAYTFTSF DDTAAHLAAL WQEALAGQRF RVSVKRRGQH
DFTSAELERY LGGALLAAAP QAKVDLTQPD VDVTLELQDQ HLRLVTRRLP GLGGYPLGTQ
GQALALISGG YDSPVAAWRM IRRGLKTHYL FFNLGGAAHE RTVREVVHRV WQGYSASHRV
HFISVPFEEV LDEIQRRVPA GLAGVVLKRM MLRAANRVAA RARIPALITG DALAQVSSQS
VTNLGLIDRV SERPVLRPLI ADDKQRIIED AQRIGTAEHA ERLPEYCGTI SRRPNTRPKL
AQIEAAEADF DMSVLDAALE AASRTRVDRL LDAPPSRPLD VPVIASADAL RAAEDVTVID
IRHPDERDAA PLKLPHGEPL AIPFYELAAR AAQLSGRQRY ALYCAQGVMS KMQALRLADQ
GLDNFLVYRD DAHAER