Gene Swit_4701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_4701 
Symbol 
ID5199095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp5171224 
End bp5172348 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID640584255 
Productbifunctional sulfur carrier protein/thiazole synthase protein 
Protein accessionYP_001265176 
Protein GI148557594 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2022] Uncharacterized enzyme of thiazole biosynthesis
[COG2104] Sulfur transfer protein involved in thiamine biosynthesis 
TIGRFAM ID[TIGR01683] thiamine biosynthesis protein ThiS 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.468322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.837099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGC GATCCGCCAC GCGGCAAGCC TTTCCGTTGC GATGGTGCGA CCCGCACCCC 
TATATGGCCC CCCGACGCAA AGCGAAAGCG CATACGGCGC CACCCATTGG AGATCCCATG
ACCAATCCCG ACGGCACGAT CCAGGTCCGC ATCAACGGCG AGCACCGCCG GGTCAGGGCA
GGGCTGACGA TCGCCGACCT CGCGCTGGAA CTGGGGCTGG AGCCGACCAA GGTCGCGGTC
GAGCGCAATC TGGAGGTTGT GCCCCGCTCG ACCTTGGGGC AGGTGCTGCT CGACGATGGC
GACGAGCTGG AAATCGTCCA TTTTGTTGGT GGTGGAGATC ATGGCGTGAC GCTTGACGAC
GACAGCTGGA CGGTGGCGGG GCGGACCTTC CGGTCGCGGC TGATCGTCGG CACCGGCAAG
TACAAGGACT TCGCCCAGAA CGCCGCCGCG GTCGAGGCGT CGGGGGCGGA GATCGTCACC
GTCGCGGTCC GTCGCGTCAA CGTCGCCGAT CCCAAGGCGC CGATGCTGAC CGACTATATC
GACCCGAAGA AGATCACCTA TTTGCCCAAC ACCGCCGGCT GCTACACCGG CGAGGAGGCG
ATCCGCACGC TGCGCCTGGC GCGCGAGGCG GGCGGCTGGG ACCTCGTCAA GCTCGAGGTG
CTGGGCGAGG CGAAGACGCT CTATCCCGAC ATGGTCGAGA CGCTGCGGGC GACCGAGGTG
CTGGCCAAGG AAGGCTTCAA GCCGATGGTC TATTGCGTCG ACGATCCGAT CGCCGCCAAG
CGGCTGGAGG ATGCCGGCGC GGTCGCGATC ATGCCGCTCG GCGCGCCGAT CGGCTCGGGC
CTCGGCATCC AGAACCGGGT GACGATCCGC CTGATCGTCG AGGGCACCAG CCTGCCGGTG
CTGGTCGACG CCGGGGTCGG CACCGCGTCG GAGGCGTCGT CGGCGATGGA GCTGGGCTGC
GCGGGCGTGC TGATGAACAC CGCGATCGCC GAGGCGAAGA ACCCGGTGAT GATGGCGCGC
GCGATGAAGC TGGCGGTCGA GAGCGGCCGC CTCGCCTATC GCGCCGGCCG CATGGGCCGC
CGCATGTACG CCGATCCGTC GAGCCCGCTG GCCGGGCTGA TCTGA
 
Protein sequence
MTGRSATRQA FPLRWCDPHP YMAPRRKAKA HTAPPIGDPM TNPDGTIQVR INGEHRRVRA 
GLTIADLALE LGLEPTKVAV ERNLEVVPRS TLGQVLLDDG DELEIVHFVG GGDHGVTLDD
DSWTVAGRTF RSRLIVGTGK YKDFAQNAAA VEASGAEIVT VAVRRVNVAD PKAPMLTDYI
DPKKITYLPN TAGCYTGEEA IRTLRLAREA GGWDLVKLEV LGEAKTLYPD MVETLRATEV
LAKEGFKPMV YCVDDPIAAK RLEDAGAVAI MPLGAPIGSG LGIQNRVTIR LIVEGTSLPV
LVDAGVGTAS EASSAMELGC AGVLMNTAIA EAKNPVMMAR AMKLAVESGR LAYRAGRMGR
RMYADPSSPL AGLI