Gene Saro_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2472 
Symbol 
ID3916791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2671180 
End bp2672589 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content64% 
IMG OID640445227 
Producthypothetical protein 
Protein accessionYP_497742 
Protein GI87200485 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGACGA GAAGCCTTGT TTTCGCCGTG GCGGTCGCGA GCCTCGTGCA TGTTGTCTTG 
CCGGCACAGG CAAGGCCCGT CGCACAGGCA GCATCGTTGA CCGATACCCA GATCGAGGAA
CTGGCCTATC GCGGCTATGT CTGGGGAATG CCGCTGGTCG AGGCGGCGCT GATCCGCAAG
CGGTTCACGC TGGGACAGTC GGCGGGCGAT CCCGGAACGC CCATAAACGC GTTCAAGCAC
CGGCGGACGC TTTCAGGCCC GGAAATGCGA GTTGGAGTCG GGCCGAACAA CGACACGATC
TATTCATCGG CATGGCTGGA TCTGGCGACT GGCCCGCTGA TCGTGTCGGC GCCCGACTTC
GGCAGGCGCT ATTACACCTT CTCGATCAAT CTCGCAGACT CCTCGTCGGA ACGATCCCTG
GGGCAACGGA CGCACGGCGG ACAGTTGCCG CCGCTGTTCG TACATGGCCC GGGGTGGCAC
GGGTCCGCTC CACCGGGGAT GGTCGATGTG CCTAGTTCTA CCCGCTATGT GAACGTCGCC
GGTCGCATCC TCGTCCGTTC CCCGTCCGAG TACGACGAGG TCCATGCCCT TCAGGACAAG
CTCGCCGTGA TCCGCTGGGC CGATTGGCGC AAGGGTAAGC ACACTCCCGC ACTGGCAGCA
GATCAGCGGG AACTTGCCCA TGGCCCTCAA GGCGCGCCGC CGGAACTTGT GTTCTTTCAC
CGTCTCGCAT CGGTACTGCA GGATTGGATC GTTCGCCCCG AAGATCGTGC CATGATCGCA
GAACTCTCGC GCCTGGAGAT TACCCAGAAG GACGGCTTCC GGCCCGGCAG GCTTGCTGCG
GGTCAACTCG CCGCACTGGC GCGCGGTTTC GTTCGCGCAC GAGAGGCCGT CCGCCTTGCG
TCGCTTCGCC TCGGAGTGGA GCGCAATGGC TGGACCACAA ATTATCGCGG CCCGCGCTTC
GGATCAGACC TTATGCTGCG GGCTGCGGTA GCCAAGGACC AGATCTTCGT GGCCGTACCG
GAAGAGGCCA TCTATCCGAT TGCGCAAGTG GACGCGGCGG GAATCAGGCT CGACGGCGCG
CACCGTTACC GCATCTGCTT TGGCTCCGGA CAATTGCCGC CGGTGGACGC GTTCTGGTCG
ATCACGGCAT ACGATGACAC GGGCTTCATG ATCCCCAACG CCGTGCATCG CTACTCCGTC
GGGGATCGGA CCGACGGTCT GGTCGCAGAC AAGGATGGCG GCGTCACGAT CGAGGTCGGC
GCGACTGCAC CCGTGGACGG CGCAACCGTC AACTGGCTGC CGGTCGCAGC CGATGCGCCG
TTCTATCTCA TGATGCGCCT TTATCGGCCG CAGGGCAGCG CGCTCGAACA GGTGTGGGTA
CCGCCGGCGA TCAGACGGGT CGATCAATAG
 
Protein sequence
MKTRSLVFAV AVASLVHVVL PAQARPVAQA ASLTDTQIEE LAYRGYVWGM PLVEAALIRK 
RFTLGQSAGD PGTPINAFKH RRTLSGPEMR VGVGPNNDTI YSSAWLDLAT GPLIVSAPDF
GRRYYTFSIN LADSSSERSL GQRTHGGQLP PLFVHGPGWH GSAPPGMVDV PSSTRYVNVA
GRILVRSPSE YDEVHALQDK LAVIRWADWR KGKHTPALAA DQRELAHGPQ GAPPELVFFH
RLASVLQDWI VRPEDRAMIA ELSRLEITQK DGFRPGRLAA GQLAALARGF VRAREAVRLA
SLRLGVERNG WTTNYRGPRF GSDLMLRAAV AKDQIFVAVP EEAIYPIAQV DAAGIRLDGA
HRYRICFGSG QLPPVDAFWS ITAYDDTGFM IPNAVHRYSV GDRTDGLVAD KDGGVTIEVG
ATAPVDGATV NWLPVAADAP FYLMMRLYRP QGSALEQVWV PPAIRRVDQ