Gene Saro_3290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3290 
Symbol 
ID3915937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3507383 
End bp3508378 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content66% 
IMG OID640446075 
Productbifunctional sulfur carrier protein/thiazole synthase protein 
Protein accessionYP_498559 
Protein GI87201302 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2022] Uncharacterized enzyme of thiazole biosynthesis
[COG2104] Sulfur transfer protein involved in thiamine biosynthesis 
TIGRFAM ID[TIGR01683] thiamine biosynthesis protein ThiS 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGAC AGCTATCCCT TACCGTCAAC GGCGAACCCC GCCGCGCCGC GCCCGGATCG 
ATCGCCGACC TGGTGCGCAG CCTGGAACTC GATCCGGCCA AGGTCGCGGT CGAACGCAAT
GGCGAGATCG TCCCGCGCTC GACCTTGGCC AGCGTGGCGA TCGCCGATGG GGACGTGCTG
GAAATCGTGC ATTTCGTGGG TGGAGGACAA TCGGACGTGA CCGACAACAA CGATACCTGG
ACCGTCGCCG GACGCACCTT CACCTCGCGC CTGATCGTGG GCACGGGCAA GTACAAGGAC
TTCGAGCAGA ACGCCGCCGC GGTCGAAGCA TCGGGCGCGG AGATCGTCAC CGTCGCCGTG
CGCAGGGTCA ACGTCTCGGA CCCCAAGGCG CCGATGCTGA CCGACTACAT CGACCCGAAG
AAAATCACCT ACCTGCCCAA CACCGCCGGC TGCTTTACCG CCGAGGACGC GATCCGCACG
CTGCGCCTTG CGCGCGAGGC GGGCGGCTGG GATCTGGTGA AGCTGGAAGT CCTGGGCGAG
GCGCGCACGC TCTATCCCAA CATGATCGAA ACGATCCGCG CGACCGAAGT CCTGGCCAAG
GAAGGCTTCC TGCCAATGGT CTATTGCGTC GACGATCCGA TCGCTGCCAA GCAGCTTGAA
GACGCGGGCG CGGTCGCCGT CATGCCACTG GGCGCGCCGA TCGGTTCGGG CCTCGGCATC
CAGAACAAGG TAACGGTGCG GCTGATCGTC GAAGGCGCCA AGGTGCCGGT GCTCGTCGAC
GCAGGCGTGG GCACCGCTTC CGAAGCCGCC GTGGCGATGG AGCTTGGCTG CGATGGCGTG
CTGATGAACA CCGCCATCGC CGAGGCCAAG GACCCGATCC GCATGGCCCG CGCAATGAAG
CTGGCCGTTC AGGCCGGACG CGACGCCTAT CTCGCCGGCC GCATGCCGAC GCGCAAGTAC
GCCGATCCGT CGAGCCCGCT GGCCGGGTTG ATCTGA
 
Protein sequence
MTGQLSLTVN GEPRRAAPGS IADLVRSLEL DPAKVAVERN GEIVPRSTLA SVAIADGDVL 
EIVHFVGGGQ SDVTDNNDTW TVAGRTFTSR LIVGTGKYKD FEQNAAAVEA SGAEIVTVAV
RRVNVSDPKA PMLTDYIDPK KITYLPNTAG CFTAEDAIRT LRLAREAGGW DLVKLEVLGE
ARTLYPNMIE TIRATEVLAK EGFLPMVYCV DDPIAAKQLE DAGAVAVMPL GAPIGSGLGI
QNKVTVRLIV EGAKVPVLVD AGVGTASEAA VAMELGCDGV LMNTAIAEAK DPIRMARAMK
LAVQAGRDAY LAGRMPTRKY ADPSSPLAGL I