Gene Saro_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0199 
Symbol 
ID3916187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp205411 
End bp206622 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content68% 
IMG OID640442925 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_495482 
Protein GI87198225 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGTGG ACACGATCAC GCGCGGCGCG CAATGGCCGG GCCTGCTGAA TCCCGACGGC 
AGCCGCTGGC ACTATCTCGA CACCGCAGCC ACCGCGCAGA AGCCGCAGGT GGTGATCGAC
GCGGTAACTC GCGCCCTCGG CGCCGACTAC GCCACCGTCC ATCGCGGCGT CTATGCCCGC
TCGGCCGACA TGACGCTCGC CTTCGAGGCC GCGCGCCGCA AGGTGGCGGG GCTGGTCAAC
GGCGATGAAG GCGAGATCGT CTTCACGCGC GGCGCAACCG AGGCGATCAA CCTCGTCGCG
CAGACCTGGG GCCAGGCAAA CCTCAAGGCG GGCGACCGCA TCCTGCTCTC CACGCTCGAG
CATCATTCGA ACATCGTACC TTGGCAGTTG CTGCGCGACC GGACCGGAGT CGAGATCGAC
GTCTGCCCGT TGACCGAGGA CGGCCGCATC GACCTTGCCG CGGCGGAGCG CATCCTGACC
CCCGCGCACA AGCTTGTCGC TCTTGCCCAT GTGTCGAACG TGCTCGGTTC TGTGCTCGAC
GTGGCGCAGG CGGTCCGGCT GGCGCGTGCG GTCGGGGCGA AGATCCTGCT CGACGGCTGC
CAGGCGGTGC CGCGCCTCGC TGTCGACGTG AAGGCGATGG ACGCTGATTT CTACGTCTTT
TCCGCCCACA AGCTCTATGG CCCGACCGGC ATAGGCGCGC TTTGGGCCAA GGCCGCGATT
CTCGACGCCA TGCCGCCGTG GCAGGGCGGC GGGGCGATGA TCGACCGCGT CACTTTCGAG
CGCACGACCT ATGCCCCCGC GCCGCAGCGT TTCGAAGCCG GCACCCCGGC CATCGTCGAG
GCCATCGGCT TCGGCGCGGC GGTGGACTTC GTGCAGGCAC AGGGCCTCGA TGCGATCCAC
GCCCATGAAG TCGCGCTCGT GGCCAAGGCC CGCGAGGCGC TCGGGCGGAT GAACTCCGTC
CGCCTGTTCG GGCCCGAGGA CAGCGCCGGC ATCGTCAGCT TCGCCATCGA GGGCGTGCAC
CCGCACGATC TCGGCACGAT CCTCGACGAG GAAGGCGTGG CGATCCGTGC CGGGCACCAC
TGCGCGCAGC CATTGATGGA CCACCTTGGC GTTCCCGCCA CGGCCCGGGC CAGCTTCGGC
ATCTACAGCG ATGAAAGCGA TATCGCCGCC CTCGTGCGCG GCATCGAAAG GACCAAGAGG
ATATTCGGAT GA
 
Protein sequence
MSVDTITRGA QWPGLLNPDG SRWHYLDTAA TAQKPQVVID AVTRALGADY ATVHRGVYAR 
SADMTLAFEA ARRKVAGLVN GDEGEIVFTR GATEAINLVA QTWGQANLKA GDRILLSTLE
HHSNIVPWQL LRDRTGVEID VCPLTEDGRI DLAAAERILT PAHKLVALAH VSNVLGSVLD
VAQAVRLARA VGAKILLDGC QAVPRLAVDV KAMDADFYVF SAHKLYGPTG IGALWAKAAI
LDAMPPWQGG GAMIDRVTFE RTTYAPAPQR FEAGTPAIVE AIGFGAAVDF VQAQGLDAIH
AHEVALVAKA REALGRMNSV RLFGPEDSAG IVSFAIEGVH PHDLGTILDE EGVAIRAGHH
CAQPLMDHLG VPATARASFG IYSDESDIAA LVRGIERTKR IFG