Gene Saro_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3767 
Symbol 
ID5077915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp404342 
End bp405880 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content66% 
IMG OID640481490 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001166152 
Protein GI146275992 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0143855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGATC CGCTGCCCGC CACCATACCC CATGCCGCGC AGGCGGCGGC GGCCACCTGG 
CCCGATGCAC CTTCCGTGCT CGAAAACGGC GAAGTCTGGA GCTTTGCCGA GCTGTGGCAG
CGCAGCCGCG CGGCGGCTTC GGCGCTGATC GCGCGGGGCA TAAAGGCCGG AGATCGCGTG
GCGATCTGGG CGCCGAACTC GCGCGAGTGG ATCGTTGCGG CGATCGCGAC CATGGCGTGC
GGGGCGGCGG TGGTGACGCT CAACACGCGG CTCAAGGGCC GCGAGGCGGG CGACATCTTG
CGGCGCACCA ATGCGCGGCT TCTGTTCACG GTCGAGGGCT TTCTCGGCAT CGACTATCGC
GCGCTGATCG TGGACGAGGA TCTGCCGGCG CTTGAAGGAA CAGTCCTGCT TGACCGGGAA
TTCGACGCTT TCATGCGTGA CGGCAGGGGG GCGGGCGATC CGGCGGTCGA TGCGGCGATG
GCGCAGATCG ATGCCGATAC CGTGTCGGAC ATCCTGTTCA CCAGCGGCAC GACAGGCAGC
CCCAAGGGCG TGTTGATGAC TTATGGCCGC GTCCTGCCTC AGGCGGCGGT GTGGTGCGCG
AACACCCGCC TGACCGAGGG CGACCGTTAC CTGATTGCCA ACCCTTTCTT CCATTCCTTC
GGGATGAAGG TGGGCTGGGT CGCGTGCATC CTCGCCGGTG CCGTGGCAGT GCCCATGCTG
CAATTCGACG TGGGTCAGGC GATTGATCTG ATCGAGCGCG AGCGCATCAC GTTCATGCCC
GGTCCGCCGA CGATCTTCCA GATGCTGCTG GCCGAACTCG ACAAGCGCAA GTGGGACTGT
TCCTCGCTGC GGGGAGGAAC GACCGGCGCG GCGACAGTGC CGCCCGCACT GGTGGAGCGC
ATCCGCAACG ACCTGGGCAT GGTGGACCTC ATCACCGCCT ATGGCATGAC CGAATGCGTC
AACATCACGA CGTGCGTGCC CGGTGACGAT GCCGAGACCA TTGCACGGAC CTGTGGCAAG
GCGTTTCCGG GCAACGAGGT GCGCATCGCC GACGAGAACG GGAACGAACT GCCAAGGGGC
GAGGCGGGAG AGGTTCTGGT GCGGGGGCAG GGCGTCATGC TCGGTTATCT CGACAACCCG
GAAGCCACTG CCGAGGCGAT CGACGCAGCA GGCTGGTTGC ACACGGGCGA CGTCGGCACG
ATGGACGAGC GCGGTTATGT GCGCATCACC GACCGGATGA AGGATCTCTA CATCTCGGGC
GGGTTCAATG TGTACCCGGC GGAAGTCGAA AAGCTGCTGG CCGAGCATCC GGCCATCGGA
ATGGCTGCGG TCGTCGGCGT TCCGGACGAG CGACTGGGCG AGGTGGGGCG CGCCTTCGTG
GTCCTGCGGC CCGGCGCGAG TGCGACCGAG GCCGAACTTG TCGCCTGGTC GCGCGAGAAC
ATGGCGAACT ACAAGGTGCC GCGCAGCTTT GTGCTGGTCG ATGACCTGCC CCGCAACGCG
TCGGGCAAGG TGCTGAAGAC CGAACTGCGG GCCGGATAA
 
Protein sequence
MLDPLPATIP HAAQAAAATW PDAPSVLENG EVWSFAELWQ RSRAAASALI ARGIKAGDRV 
AIWAPNSREW IVAAIATMAC GAAVVTLNTR LKGREAGDIL RRTNARLLFT VEGFLGIDYR
ALIVDEDLPA LEGTVLLDRE FDAFMRDGRG AGDPAVDAAM AQIDADTVSD ILFTSGTTGS
PKGVLMTYGR VLPQAAVWCA NTRLTEGDRY LIANPFFHSF GMKVGWVACI LAGAVAVPML
QFDVGQAIDL IERERITFMP GPPTIFQMLL AELDKRKWDC SSLRGGTTGA ATVPPALVER
IRNDLGMVDL ITAYGMTECV NITTCVPGDD AETIARTCGK AFPGNEVRIA DENGNELPRG
EAGEVLVRGQ GVMLGYLDNP EATAEAIDAA GWLHTGDVGT MDERGYVRIT DRMKDLYISG
GFNVYPAEVE KLLAEHPAIG MAAVVGVPDE RLGEVGRAFV VLRPGASATE AELVAWSREN
MANYKVPRSF VLVDDLPRNA SGKVLKTELR AG