Gene Saro_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2353 
Symbol 
ID3915698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2500513 
End bp2501571 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID640445109 
Producthypothetical protein 
Protein accessionYP_497624 
Protein GI87200367 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGCA AGTCAATTTT CAGGACCAGT GCGCTCGCCG CACTTGCCGT CGCGATGTCG 
GCGGCGGCCA TTCCAACATC TGCAATGGCA GAGCCGCAGC GCAGCGAAGA GCGCGGCGGT
CGCTGGGGCG GCGGTGACCG TGGTGGCGAT CGTGGGGGCG ACCGCGGCGG TGAATTCCGT
GGCCGCGCGC AGGGCCAGGC CCAGACGCAG GCCCAGCCGC AGCAGCGCTC AGGCTGGGGC
GGCGGTCAGC AACAGGCGCG TCCGGAACGC AGCGCACCTG CATGGCAGGG CCGGGGCAAC
GCGGACAATG CGCCGCGCTG GGGTTCGCAG GATCGCAGCG GCGGCCAGCG CCCGGGCCGT
GACTGGCAAT CGGGCACGGT GACGCGCCCT GCACCCTCAG CGCGTGCAGC AACTCCCGCC
ACTCCGCAGC GCGGCTGGGA CGGCACCCGC TGGAACCCGA CCAATCCGGA TCGCAACACG
GGCCGCGACT GGAATCGCAA CCGCGACCGC AATGACGGCC GCGAATGGTC CAACCGCGAC
CGCGACAATC GCGATGGCCG CGGCACCACC TGGGGCGGCC GCAACGATGG CCGGCGCGAT
TATCGCAACG GCGACAGCTG GCGCAGCGGA GATAGCTGGC GCAGCGGAGA TAGCTGGCGT
AGCGGAGATA GCTGGCGGCG CGACAACGAT CGCCGGGATG GGCGGGATCG CCGGGATGGG
TGGCGCGGCG ACCGACGCGA TGACCACCGC CGGTGGAGCA ACGACTGGCG CCGCGACAAC
CGCTACAACT GGTACGGCTA TCGCGACAGC CACCGCCACG TCTACCGGAT GCCGCGCTAT
TATGCGCCGT ACCGGGGCTA CAACTACAGC CGCCTGTCGA TCGGGATATT CCTGAATTCG
GGCTTCTATG GCAGCAGCTA CTGGATCAAC GATCCATGGT CTTATCGCCT GCCCCCAGCC
TACGGTCCGT ATCGCTGGGT GCGCTACTAC GATGACGTGC TGCTGGTCGA CACCTACTCC
GGCGAAGTGG TGGACGTGAT CTACGACTTC TTCTGGTAA
 
Protein sequence
MSGKSIFRTS ALAALAVAMS AAAIPTSAMA EPQRSEERGG RWGGGDRGGD RGGDRGGEFR 
GRAQGQAQTQ AQPQQRSGWG GGQQQARPER SAPAWQGRGN ADNAPRWGSQ DRSGGQRPGR
DWQSGTVTRP APSARAATPA TPQRGWDGTR WNPTNPDRNT GRDWNRNRDR NDGREWSNRD
RDNRDGRGTT WGGRNDGRRD YRNGDSWRSG DSWRSGDSWR SGDSWRRDND RRDGRDRRDG
WRGDRRDDHR RWSNDWRRDN RYNWYGYRDS HRHVYRMPRY YAPYRGYNYS RLSIGIFLNS
GFYGSSYWIN DPWSYRLPPA YGPYRWVRYY DDVLLVDTYS GEVVDVIYDF FW