Gene Saro_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0445 
Symbol 
ID3918313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp487393 
End bp489180 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content69% 
IMG OID640443174 
Productpara-aminobenzoate synthase, component I 
Protein accessionYP_495727 
Protein GI87198470 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.59777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGTG CTCCCTTCAT CCTGCTCGAC GACGCCCGCA GCGAGGGCGC GAGTGATGCG 
CGCCTCTACG AGGACGCCGT CGAGATCGTC GTCGCGCGCC GCGCCGAGGA GGTCGAGGCC
GCGCTTGAAC GGATCGCGGC TGTCCCCGGC AACTGGGCAG GCTACCTCGC CTACGAGGCG
GGCCTCGCGC TCGAACCGCG TCTCTTGCCG CTGGCGGCGG TGCGAACCGG TGCTGACGGC
CCGTTGGTGT GGTTCGCGCG CTTCGACCGC GTCACCCGGA TGCCGAGCGC GGAGGTCGAG
CAGTGGCTGG TGCGCAATGC GCGCGGCCCC GGGCGCCTCG GTCCGCTCGG CCCTGCGCTG
TCGTCGGGCG GCTATGCCCG CGCCTTCGAC CTGGTGCAGG AGGCGATCCG CGCGGGCGAC
ATCTACCAGG CGAACCTGAC CTTCCCGCTG ACCGGCACCT GGGATGGCGA TCCGCTGTCG
ATCTACGGCG AAGTGCGCCG CGATGCGCGC GCGGGCTATG GCGGCGTGAT ATGGGACGGC
GCGCATTGGC ACCTCTCGTT CTCGCCCGAA CTCTTCTTCA AGCTTTCCGG CGGAGAAGCG
CGGGTGCGCC CGATGAAGGG CACTGCCCCG CGCGGGGCAA CCCCCGAAGA GGACGCGCGC
TTCAAGGCCG ACCTCTCCGC CAGTGTAAAA GACCGCGCCG AAAACCTGAT GATCGTCGAT
CTCATGCGAA ACGATCTCGC GCGCGTTTCA CAGGCCGGGT CGGTCAAGGT CGAGGCGCCC
TTCGCCATCG AATCCTATCC TACCGTCCAC CAGATGGTCA CCACCGTGCG CGCCAGGCTT
GCCCCCGGCG AGGACGTGCG CAGCCTCCTG CGCGCGATCT TCCCCTGCGG TTCGATCACC
GGAGCGCCCA AGATCCGCGC GATGGAGATC ATCCATGCCG CAGAACGGGA CGCGCGCGGG
CTCTATTGCG GCTCGATCGG ACGGATCGAC CCCACTGGCG AGGCGGCGTT CAATGTGGCA
ATCCGCACGC TGCGTCTTTC GAGCGATCAC GCCGGGGCCT CCGGGGGCCG CGCGGTCATG
GGCGTCGGCT CTGCGGTCGT TGCCGATTCG GCCATGATCG GCGAATGGCG GGAATGCGTG
GTGAAGGGGA ATTTCTTGCG TCTGTCCGCC GGCCATGCCG ACCTTATCGA GACCATGGCC
TTCGATCCGG CCAAGGGTAT CGAACTGCTC GAACTGCACC TCGAACGCAT GAAGGCCAGT
GCGCTTGCGC TAGGCTACAG CTTCGACCGG CACGCGGTGC GCAATGCGAT CCATGCGCTT
TGCTTCGATC TCGACGCGCC CTCGCGGGTC CGGCTCGTCG TGTCGAAGGG GGGCGCCCAT
GCGCTCGATG CCTCGGCCAT GCCGGCGCCG ATCGAAAGCC TGACCTGCGC GGTGCTCTCG
CTTCCGGTGT CGGACGGCGA CTGGCGCCTG CGCCACAAGA CCACGGACCG CGCCTTCTAC
GAGGCGGGCC TGTCCGCCGC GAAACGGGCG GGCGCCGGTG AGGCCCTGTT CCTGCGCGAC
GATGGCTTCC TTACCGAAGG CACGTTCACC AACATCTTCG TCGAGCGCGA TGGCATTCTG
CTCACCCCCC GCGCCGAACT CGGCCTGCTT CCCGGCGTAC TGCGCCGAAG CCTGATCGAG
GCCGGACGTG CGGTCGAGGC GGACCTGACG CTCGATGACC TTGCCGGCGG CTTCCTCGTC
GGCAACGCGC TGCGCGGCCT CATCCCGGCT CGCCTGCTTG GCGCCTGA
 
Protein sequence
MTRAPFILLD DARSEGASDA RLYEDAVEIV VARRAEEVEA ALERIAAVPG NWAGYLAYEA 
GLALEPRLLP LAAVRTGADG PLVWFARFDR VTRMPSAEVE QWLVRNARGP GRLGPLGPAL
SSGGYARAFD LVQEAIRAGD IYQANLTFPL TGTWDGDPLS IYGEVRRDAR AGYGGVIWDG
AHWHLSFSPE LFFKLSGGEA RVRPMKGTAP RGATPEEDAR FKADLSASVK DRAENLMIVD
LMRNDLARVS QAGSVKVEAP FAIESYPTVH QMVTTVRARL APGEDVRSLL RAIFPCGSIT
GAPKIRAMEI IHAAERDARG LYCGSIGRID PTGEAAFNVA IRTLRLSSDH AGASGGRAVM
GVGSAVVADS AMIGEWRECV VKGNFLRLSA GHADLIETMA FDPAKGIELL ELHLERMKAS
ALALGYSFDR HAVRNAIHAL CFDLDAPSRV RLVVSKGGAH ALDASAMPAP IESLTCAVLS
LPVSDGDWRL RHKTTDRAFY EAGLSAAKRA GAGEALFLRD DGFLTEGTFT NIFVERDGIL
LTPRAELGLL PGVLRRSLIE AGRAVEADLT LDDLAGGFLV GNALRGLIPA RLLGA