Gene Saro_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3489 
Symbol 
ID5077638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp91970 
End bp93592 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID640481213 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001165875 
Protein GI146275715 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGA ACACTTCGAA GCCCTGGGAC TGGCTGCCGA TTCCCGCCCC TCACCAGCAG 
GCCTTCGCGC AACGCGGGGC CTGGAACCTG CGGACGCTGG CAGACCTCGC GCGCGAACGG
GCGGCGTCAG ATCCCGATTT CGTCTGCTTC GTCGATGGCG AAGGCCAATA TACCTTCGCG
CAGGTCCTGG CAGAAGCGGA AGCGCTTTCC GCGTCGTTGC ACGCGCGTGG GTTTCGCGCC
GGCGATGTCA TCGCATTCCA GGTGCCGAAC TGGCGCGAGG CCGCCGTCAT CAACCTGTCG
GCGGCGATGT CTGGCTTCGT GGTCAATCCG ATCGTGCCGA TCTATCGCGA TGCCGAAGTC
ACGATGATGC TTGGCGATTG CCGGGCCGCC GCGATCTTCG TGCCGCAGGT GTTCCGCAAG
GTCGACTACG CCGAAATGGC GCGTCGCTGC CAGAAGGCGC TGCCCGATCT TGCGCACGTC
TTCACCGTGC GGGGCGAGGG GCCGGACGAT TTCGCCACTC TCGTCGCACA GGGGCGCGCT
CTTTCCTTCG AAGTGCCAAC GGTCGATCCG ATGGGCGTCA AGATGGTGCT CTATACCTCG
GGCACGACCG GTCGGCCCAA GGGCGTCCTG CACAGCCATT GCACGTTGCA GCGCATCGTC
GCGGAAAGCG GGCGGCACTG GGGCCTCGGG GCAGGGGAGG CGACGCTGAT GCCTTCGCCG
GTCACGCACG TCTCGGGATA TGCCAATGGC CTCGAAGCGC CGTTCATCTG CGGCATCCGC
TCGGTTCTCA TGGAAGCGTG GAACGCCCAG GATGCGCTGG CCCTGATCGA GAAGCACGAC
CTTGTCGGCA CGGTTGCTGC AACGCCCTTC CTGGTCGAAC TTGCGGCAGC GGCGCGAGCG
GCGGGCACCG GCCTGCCAAG CTTCCGCTTC TTCGCCTGCG GCGGTGCGGC GGTGCCGGCG
GACCTTATCC CGGCCGCCAA CGCCGCCTTC GAGAACTGCC GGGCCTTTCG CGTCTTCGGC
GCGTCCGAAG TTCCGCTCGT TACCTTCGGC TGGCCGCACG ACGAGCGCCT TGCCGCGACC
ACCGATGGCG AGGTGGTGGA CTACGAAGTC CGCATCGTCG ACCACGAGGA CAATGATCTT
CCGCGCGGTG TCGAAGGCGA AATCCTTGCG CGCGGTCCCG GCATGATGAT GGGCTATGCC
GACGCCGCGC AGACCGCAGA GGCGATCACG CCCGACGGCT TCTTCCGCAC CGGCGACCTG
GGCGTGCTGT CCGAAGAGGG TGCGGTAACG ATCACCGGGC GCAAGAAGGA CCTCATCATC
CGCGGCGGAG AGAACATCTC GGCCAAGGAA ATCGAGGACG TGCTGCACAG CCATGACGCG
GTGAAGGAAG CCTCGGTCGT CGCCATGCCG CACGAACGCC TTGGCGAGGG CATCTGCGCC
TATGTGATCC TGTCCGCCGC AGTCGACGCG GCGGTGCTTG CCGCGCATGT TGCCGCTTCG
GGCATGGCGA AGCAGAAGAT CCCCGAACGC TTCGAATTCG TAGAGGACTT TCCCCGCACC
GCTAGCGGCA AGGTCCGCAA GGACCAGCTG CGGGCGATGA TCCGGGAGAA AGTGGGGGGC
TGA
 
Protein sequence
MNENTSKPWD WLPIPAPHQQ AFAQRGAWNL RTLADLARER AASDPDFVCF VDGEGQYTFA 
QVLAEAEALS ASLHARGFRA GDVIAFQVPN WREAAVINLS AAMSGFVVNP IVPIYRDAEV
TMMLGDCRAA AIFVPQVFRK VDYAEMARRC QKALPDLAHV FTVRGEGPDD FATLVAQGRA
LSFEVPTVDP MGVKMVLYTS GTTGRPKGVL HSHCTLQRIV AESGRHWGLG AGEATLMPSP
VTHVSGYANG LEAPFICGIR SVLMEAWNAQ DALALIEKHD LVGTVAATPF LVELAAAARA
AGTGLPSFRF FACGGAAVPA DLIPAANAAF ENCRAFRVFG ASEVPLVTFG WPHDERLAAT
TDGEVVDYEV RIVDHEDNDL PRGVEGEILA RGPGMMMGYA DAAQTAEAIT PDGFFRTGDL
GVLSEEGAVT ITGRKKDLII RGGENISAKE IEDVLHSHDA VKEASVVAMP HERLGEGICA
YVILSAAVDA AVLAAHVAAS GMAKQKIPER FEFVEDFPRT ASGKVRKDQL RAMIREKVGG