Gene Saro_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2021 
Symbol 
ID3917342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2154555 
End bp2156090 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content69% 
IMG OID640444773 
Productanthranilate synthase, component I 
Protein accessionYP_497294 
Protein GI87200037 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0405957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGA TCGACGCATC CGCCGTCTCG CTGCCGGAAA ACCACCGCGC CGCGCTTTCC 
CAGCTTTCCG CCGGAAAGCC GGCGCTGGTC TGGCGCAAGC TGATCGTCGA TACCGAGACT
CCCGTCGGCG CCGCGCTCAA GCTGATGGAA AGCGGTCGCG GCGATTTCCT GCTCGAATCC
GTGCAGGGCG GCGAAGTACG CGGGCGCTAC AGCCTGCTCG GGCTCGATCC CGATCTCGTC
TTCCGCGCCA CCGGCTCGTC GGCAGAGATC AACCGCATCT GGCGGCACGA CAAGGCAGCC
TTCGCACCGC TACCCGGCGA TGCCCTCGCC GAACTGCGCG CGCTCGTCGC CTCCTGCCGC
ATCGACGTCC CGGCCGAGCT GCCCTCGGCG CTCGCCTGCC TCGTCGGCTA CTTCGGCTAC
GAGACCATCG GCCTGGTCGA GAAGCTGCCC CGCGCACCGC AGAGCGAGCT TGTCCTGCCC
GACATGCTGT TCACCCGCCC GACCGTGGTG CTGGTGTTCG ACCGCCTGTC CGACGAACTC
TTCGCCATCG CCCCGGTCTG GGCCGAAGGC GGCGATCCGG CGCGCCTGCT CGAAGCCGCG
GCGGAGCGCA TCGACAATGC CCTGCGCCGG CTGTCCGATC CGGTCCCCGC CGATGCGCGC
CTTGCCGAAG CGGTCGACGT CACGCCGCAG CCAGTCATGG CCGCACCCGA CTATGCGCGT
ATGGTGACTG CCGCCAAGGA CTACATCGAG GCGGGCGACA TCTTCCAGGT CGTCCTCGCC
CAGCGCTTCA CCGCGCCCTT CCCGCTGCCG CCCATCGCGC TCTACCGTTC GCTGCGCCGC
ATCAATCCCT CGCCGTTCCT CTACTTCCTC GACATGCCGG GCTTTGCGCT CACCGGCTCC
TCGCCGGAAA TCCTGGTCCG CATCCGCGAC GGCGAAGTCA CGATCCGCCC GATTGCCGGC
ACCCGCCCGC GCGGGCGCAC CGCCGAGGAA GACCGGGCCA ACGAAGAGAG CCTGCTGGCC
GATCCCAAGG AACGCGCCGA ACACCTCATG CTGCTCGACC TCGGCCGCAA CGACGTCGGC
CGCGTGGCCA GGGCCGGCAC CGTGAAAGTC ACCGAAAGCT ACACGGTCGA ACGCTACAGC
CACGTGATGC ACATCGTCTC GAACGTGGTC GGCCAGCTCG ACACGAACCG CGCCGACAGC
GTCGACGCCC TCTTCGCCGG GTTCCCCGCC GGCACAGTCT CGGGCGCACC CAAAGTCCGC
GCCTGCGAGA TCATCGCCGA ACTCGAACCC GAGACGCGCG GCGCCTACGC TGGCGGTGTC
GGCTATTTCG CGCCCGACGG CTCTGTCGAT AGCTGCATCG TCCTCAGGAC CGGCATCCTC
AAGGACGGCG TCCTCCATGT CCAGGCTGGC GCCGGCATCG TCGCCGACAG CGACCCCGCC
TACGAACAGC GCGAATGCGA AGCCAAGAGC GGCGCCCTCT TCGCCGCCGC GCGCGAAGCC
GTCCGTGTCG CCACAGAACC GAAGTTTGGC CAATGA
 
Protein sequence
MTTIDASAVS LPENHRAALS QLSAGKPALV WRKLIVDTET PVGAALKLME SGRGDFLLES 
VQGGEVRGRY SLLGLDPDLV FRATGSSAEI NRIWRHDKAA FAPLPGDALA ELRALVASCR
IDVPAELPSA LACLVGYFGY ETIGLVEKLP RAPQSELVLP DMLFTRPTVV LVFDRLSDEL
FAIAPVWAEG GDPARLLEAA AERIDNALRR LSDPVPADAR LAEAVDVTPQ PVMAAPDYAR
MVTAAKDYIE AGDIFQVVLA QRFTAPFPLP PIALYRSLRR INPSPFLYFL DMPGFALTGS
SPEILVRIRD GEVTIRPIAG TRPRGRTAEE DRANEESLLA DPKERAEHLM LLDLGRNDVG
RVARAGTVKV TESYTVERYS HVMHIVSNVV GQLDTNRADS VDALFAGFPA GTVSGAPKVR
ACEIIAELEP ETRGAYAGGV GYFAPDGSVD SCIVLRTGIL KDGVLHVQAG AGIVADSDPA
YEQRECEAKS GALFAAAREA VRVATEPKFG Q