Gene Saro_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3231 
Symbol 
ID3917489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3451482 
End bp3452753 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content64% 
IMG OID640446015 
Producthypothetical protein 
Protein accessionYP_498500 
Protein GI87201243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGGCG CGCTACGGGC ATCGATCAAG CGGGTCATCC CGGCGCGACT CAAGCGCCTG 
ATAAAGGACG TGCGCGATCT GGCGACGCCC CCGCCCATCG ATGACGTCGT CCTGGCGGAA
TATGTCCTAG TGCCCGATGC CTCGGAGGCG CCGCGGCTGA ACTTCGTGAT CTCGAACCTG
ACACGGGCCA CGGCCTTCGG CGGCGTCACC ACCGGGATAG ACGTCTTTCT CGAACTTGCG
CGGCATCTGT CGCGCAAGAC GCCCCTCGAC CTGCGCGTCA TCATCGCCGA GCCCGATCGC
GAGACCGATC TGTCGATCAT AGCCGAGCGC GCGGGCAGGT TCGGACTCGA GGGTGAGCGG
ATCGCGTTCC ACGCGGTGCG CTCGTCCACC GAGGCCATAC CGGTGCGCCG CCGCGATGTC
TTCGTGTCCT ACAGCACCTG GATGACGCTT AACCTGCAGG GCCTGCTTGC CCGGCAGGCC
GCGCATTTCG CCCAGGCGAT GAAGCCGCTG GTCTTCCTGA TCCAGGAGTA TGAGCCCCAC
TTCTACCCCT TTTCGTCGGC GCATATGCTC TCGCGCGAGG CATACGACCG GACCGAACGG
CTGTGGGGGG TGTTCAACAG CTCAAACCTG CAATGGTATT TCGAGCAGGT GGGCCATTCG
GCGGAGCGGT CGTTCCTGCT CGAGCCGGTC ATAAACGACA AGCTCCGGCC CTATCTCGAT
AGCGTCGCGA CAAGCGAGCG CCGCAAGCGC ATACTTGTCT ACGGGCGCCC CGGAATTGCG
CGCAACTGCT TTCCCGCCAT CGTCCGCGGC CTTCGCCGCT GGGTGCGCGA TTTTCCGGAG
GCGGCCGAAT GGGAGGTCGT CTCGGCCGGC ACCGCGCATA AGCCGATTGC GTTGGGGCAG
GGGCGGATGC TGGAATCGGT GGGCAAGCTG TCGCTTGAAG AATACGCGGA AATGCTCGTT
TCCAGTTCGG TGGGGCTCAG CCTGATGGCG TCTCCGCACC CAAGCTATCC CCCGCTGGAG
ATGGCCCACA TGGGGCTGCG GACCATCACC AACGGGTATT TCGGAAAGGA TCTCTCGACG
TTCCATCCGA ACATCCGAAG CGTCGGCAGC ATCACCGAAA AGGCGCTGGC CGACGCGCTG
TCCCGGGCTT GTGCCGAACA TGGCGCGCCG GTGAACGCGT ATCGCAACGA AAGCTACGTG
CGCTCCGATC CTTACCCCTT CGTGCCAGCG CTGTGCGCAG CCATCGCAGA GGAAATCGGC
ATCGCCCGCT AG
 
Protein sequence
MSGALRASIK RVIPARLKRL IKDVRDLATP PPIDDVVLAE YVLVPDASEA PRLNFVISNL 
TRATAFGGVT TGIDVFLELA RHLSRKTPLD LRVIIAEPDR ETDLSIIAER AGRFGLEGER
IAFHAVRSST EAIPVRRRDV FVSYSTWMTL NLQGLLARQA AHFAQAMKPL VFLIQEYEPH
FYPFSSAHML SREAYDRTER LWGVFNSSNL QWYFEQVGHS AERSFLLEPV INDKLRPYLD
SVATSERRKR ILVYGRPGIA RNCFPAIVRG LRRWVRDFPE AAEWEVVSAG TAHKPIALGQ
GRMLESVGKL SLEEYAEMLV SSSVGLSLMA SPHPSYPPLE MAHMGLRTIT NGYFGKDLST
FHPNIRSVGS ITEKALADAL SRACAEHGAP VNAYRNESYV RSDPYPFVPA LCAAIAEEIG
IAR