Gene Saro_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0167 
Symbol 
ID3918303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp165119 
End bp166585 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content64% 
IMG OID640442893 
Productsulfotransferase 
Protein accessionYP_495450 
Protein GI87198193 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAG ACGAAGCCAT TACCCAGCTA CGCCGCTTCG TTGCCGCGCT CGAAAGCGGT 
AGCCGGCGCA ATGCCAACGA CGCTGCGTTC GCGCTCCTTG CCGGCAATCC ACCGCTAGGT
GACCGCTGGC AGTCCATTGC CCACGTAATG CAGACGAACG GCGAGTATTC GGCCGCACAT
CTCGCCATGG CACGTTTTGC CGCTCACAGG GCCGACCCAA ATGCCCGTTT CGCGCAGGCC
GCCATGCTGG CGCAGACCGG ACAGTTGGAG GAGGCCTGGA AGGTGATGGG AAGCGTGCCC
TTCGGGGTCC CTTCCGTCTC TGGCCACCAC TACATTCGCG GCACCATCGC GGTTAACCTC
GGCCATATCG AAGATGCCGA GAAGCATCTT CTCGCCGCAA TCGAGGCAGA GCCGGCGCTC
GGGCAGGCCA TGCTGTCGCT CGCCGCGGCA CGCAAGCGCA AGGCGGGCGA CCCCATCGGC
GACCGTATTC TATCGGCGGA ACCCCGAATG GAGGGTGCAC CGCCGCTGGA GCGGGCGCAC
TTCCACTATG CGGCCGGTCG CGTCCATTTC GACCGCCGCG AACCGGACGA GGCCTTCCGC
CATTTTTCCG CCGGTGCAGA ACTCGTTCGC GGCTGGCGCT CCTACGATGC CGCTGCCGAT
GCGCGCGACG CCGCTGAATG CCGGCAGGGC TTCGACCGCG ATTTCGTCGA CAAGATCAAC
AATCTTGTAA CCCTGGACAC AAGCCAGCCC ATCTTCGTCA CCGGACTGCC CCGTTCGGGC
ACCACACTGG TCGAACAGAT CCTCGTCAGT CACTCTGCGG TGTCGGGCGG CGAGGAACTG
GGCCGCATGG GCGTCGTCCG TCGCGATCTC ACGTCGCTCT CAGCCAAGGG ACTGTCGGAC
TATCTCCAGG CCGGCGGCAA TGCTGACGAC CTTGCCGCGC TATACCTACA CCTCGGGCAA
GAGCGCTTCG GAAAAGAGGG GCGCTTCGTC GACAAGGCGC TCAACACCAG TCGTTTCACC
GGGCTCATAG CGGCGCTTCT TCCGCATGCG CCCATCGTCT GGCTGCGTCG CGATCCGGCC
GACTGCGCGT TCTCGGCTTA CCGCACGTAC TTCATCAACG GGCTCGACTG GTCCTGGAAC
CTGGAGGACA TCGCGACGCA CTTCGCGCTT GAGGACCAGC TTTTCCACCA CTGGTCGCGC
ATGTTCCCCG ACCGCATCCT CGCGGTCGAT TACCAGCAGT TGGTTCAGGA ACCCCAAGCC
CAGATTCGTC GCATCCTGGC GCACTGCAAT CTCGAGGAGG AACCGCAGGT CTTCCGTCCG
CACGAGACGC AGCGCGTGGT CTCCACGGCA AGCGTGATGC AGGTGCGCGA GCCGATCAAT
ACCGGGGCGG TTGGCGCCGC AGGTGCCTAC CGCCTGCATC TCGCCCCGTT CGTCGATCGT
TACGAGGCAC TGTCCGCCGC AACCTGA
 
Protein sequence
MTEDEAITQL RRFVAALESG SRRNANDAAF ALLAGNPPLG DRWQSIAHVM QTNGEYSAAH 
LAMARFAAHR ADPNARFAQA AMLAQTGQLE EAWKVMGSVP FGVPSVSGHH YIRGTIAVNL
GHIEDAEKHL LAAIEAEPAL GQAMLSLAAA RKRKAGDPIG DRILSAEPRM EGAPPLERAH
FHYAAGRVHF DRREPDEAFR HFSAGAELVR GWRSYDAAAD ARDAAECRQG FDRDFVDKIN
NLVTLDTSQP IFVTGLPRSG TTLVEQILVS HSAVSGGEEL GRMGVVRRDL TSLSAKGLSD
YLQAGGNADD LAALYLHLGQ ERFGKEGRFV DKALNTSRFT GLIAALLPHA PIVWLRRDPA
DCAFSAYRTY FINGLDWSWN LEDIATHFAL EDQLFHHWSR MFPDRILAVD YQQLVQEPQA
QIRRILAHCN LEEEPQVFRP HETQRVVSTA SVMQVREPIN TGAVGAAGAY RLHLAPFVDR
YEALSAAT