Gene Saro_1693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1693 
Symbol 
ID3916268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1776860 
End bp1778491 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content65% 
IMG OID640444434 
ProductAMP-binding domain protein 
Protein accessionYP_496967 
Protein GI87199710 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.301435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAGT TGTCCTACGC GAAGGGGCCC GCCGACGAGC CCCTTCTTGA AAAGACCATC 
GGCCAGGCAC TGCGCGATGC GGCGGCGCTC TGGGGCGACG AGCTTGCCCT GGTATCGCGC
CACCAGCAGA TCCGCTGGAC CTGGGCCCAG CTCGACGCAG AGGTCGATCG TATCGCCACG
GGCCTGCTCG ACAGGGGCGT GGCCAAGGGC GACCGTGTCG GCATATGGGC GCCGAACTGC
GCGGAATGGA CCGTCCTCCA GTTCGCGACC GCGCGGATCG GTGCGATCCT CGTTACCATC
AATCCCGCCT ATCGCACCAG CGAGGTCGAG TACGCGCTGA ACAAGGTCGG CTGCACCTTC
CTTGTCACCG CCGCGCGCTT CAAGACCAGC GACTATGTGG CGATGCTGCG CGAACTGGGG
CCGGACAAGC TGCCGGGCGT AAGCTGCATG GTCGTGCTCG GCGCGGATCG CCACGACGGC
TTCGAGCCAT GGGAGGCCCT GCGTGCCGAA CCGGACGCCG CACGCCTTGC CGCCGCCGAG
GCTGCGCTGA ACCAGAACGA TGCGATCAAC ATCCAGTTCA CCAGCGGCAC CACCGGCTTT
CCCAAGGGCG CCACGCTCAC CCATCGCAAC ATCCTCAACA ACGGCCATTT CACCGCCCGC
ACGATCAAGC TGACCCAGCG CGACCGCATC TGCATCCCGG TGCCGCTCTA CCACTGCTTC
GGCATGGTCC TGGGCAATCT TGCCGCGCTC GCCAGCGGGG CGGCGATGGT CTACCCCGGC
GAGGCCTACG ATCCGCAGCT TGCGCTCGCG GCGGTGGCCG AGGAGGGCTG CACCGCGCTC
TATGGCGTGC CGACGATGTT CATCACCATC CTCGCGCAGC CGGACCTTGA CCGCTACGAC
GTATCGACCT TGCGCACCGG CATCATGGCC GGCTCGCCTT GCCCGGTCAG CACGATGCGC
CAGGTCATGG ACCGCCTCAA CATGACCGAG GTGACCATCG GCTATGGCAT GACCGAGACC
AGTCCGCTCA CCACCCAGAC AGCGACCGAC GATCCGCTGG AAGAGCGCGT CGGCACTGTC
GGCCGTGTCC ATCCCCATGC CGAGGCGAAG ATCGTCGGGC TCGATGGCGA AACGCTGCCC
ATCGGCCAGC AGGGCGAATA CTGCTCGCGC GGATATGCCG TCATGCTGGG TTACTGGGAC
GATCCGGAAA AGACAGCAGA AGCCATCGAC GGCGAGGGCT GGATGCATTC CGGCGATCTC
GCGACGATGG ACGAACATGG CTATGTCCGT ATCACCGGCC GCATCAAGGA CATGATCATC
CGCGGCGGTG AGAATATCTA CCCGCGTGAA ATTGAGGAGT TTCTCCTCAC CCATCCCGCC
GTTCAGGATG CGCAGGTCTT CGGCGTTTCG GACGAGAAGT TCGGCGAGGA AGTCTGCGCC
TGGGTCATCG CGCGGTCCGG CCACGCGCTC TCGCACGACG ATATCCTCGC CCACTGCAAG
GGCCGCATCG CACACTACAA GGTGCCGCGC CATGTCCGCG TGGTCGAAGC CTTCGCCATG
ACAGTCACCG GCAAGGCGCA GAAGTTCGAG ATGCGCAAGA TGATGGAAGC CGAACTCACG
CGCACGGGCT GA
 
Protein sequence
MTQLSYAKGP ADEPLLEKTI GQALRDAAAL WGDELALVSR HQQIRWTWAQ LDAEVDRIAT 
GLLDRGVAKG DRVGIWAPNC AEWTVLQFAT ARIGAILVTI NPAYRTSEVE YALNKVGCTF
LVTAARFKTS DYVAMLRELG PDKLPGVSCM VVLGADRHDG FEPWEALRAE PDAARLAAAE
AALNQNDAIN IQFTSGTTGF PKGATLTHRN ILNNGHFTAR TIKLTQRDRI CIPVPLYHCF
GMVLGNLAAL ASGAAMVYPG EAYDPQLALA AVAEEGCTAL YGVPTMFITI LAQPDLDRYD
VSTLRTGIMA GSPCPVSTMR QVMDRLNMTE VTIGYGMTET SPLTTQTATD DPLEERVGTV
GRVHPHAEAK IVGLDGETLP IGQQGEYCSR GYAVMLGYWD DPEKTAEAID GEGWMHSGDL
ATMDEHGYVR ITGRIKDMII RGGENIYPRE IEEFLLTHPA VQDAQVFGVS DEKFGEEVCA
WVIARSGHAL SHDDILAHCK GRIAHYKVPR HVRVVEAFAM TVTGKAQKFE MRKMMEAELT
RTG