Gene Daro_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3087 
Symbol 
ID3566516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3327610 
End bp3329277 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content60% 
IMG OID637681558 
ProductPara-aminobenzoate synthase, component I 
Protein accessionYP_286287 
Protein GI71908700 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value0.972054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.655519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTGAAT GCCTCGAGGT CCGCGATAAC GCCTCGCTTG CAGAGGCATT GCAACGTCTG 
GAAGATGAGC CAGCGTGGTC GGTCGCCGCT CTCGACTACG AATTGGGTTA CCTGTTGGAG
CCAAAGTCTG CGCCGCCAGA CTGGGTGCCA GGCGAGAAGC CGCTGGCCCG TTTCTGGCGA
TTCGCCAGAC GGCAGCCCCT GATGGCCGAT GCCGTCGAGG CTTGGCTGAA CAAACAGACG
GGCAAGGCGA TTGCAGGTGT GGGTGACCTG CATCCCATCA TTAGCGAACA ACATTATGTT
GCTGCGGTGG AGCGGATCAA GCAGTTGATA TTTGCTGGCG ACTGCTACCA GGTAAATCTT
ACCTTCCCGC TCGAATTCGA CTGGTTTGGT TCGCCGCTGG CGCTCTATGC CAGATTGCGC
GAACGACAGC CGGTACGTTA CGGTGGTTTT GTCGGGGATG CTTCACAGGG CCTGGTTTCC
CTTTCCCCCG AGCTCTTTCT CGAGCGCCTG GGCGACAGGT TGCTGACCCG GCCGATGAAG
GGGACGGCGC CACGTGGCGT GCCCGCCGAG CAACTGCGCA ATTCCGCAAA GGATTGTGCC
GAAAACCTGA TGATCGTGGA TTTGCTGCGT AACGATCTTG GCCGGGTGGC TGACAACGGT
AGCGTGGTGG TAGATCGTCT ATTCGAGATT GAGGATTACC CGACAGTCTG GCAGATGGTC
TCCGAGGTCT CTGCACGCGC AGGTCGCAGT AGTTTTGGCG AAATACTGCG GGCGCTTTTC
CCATGTGGAT CAATAACGGG TGCGCCAAAG ATTCGTGCCA TGCAGATTGC CGCCGAACTT
GAAGGCGCGC TGCGGGGCAT CTATACCGGG GCTTTTGGCT GGATCGCGCC GGATGGCGAC
TTTCGATTGA ACGTCGCGAT TCGAACGCTG GCGTTGCAAG CGGGTGGTCG GGGCAAGCTG
GGCATCGGCA GCGGTATCGT GGCCGATTCT CAGCCGCTGG CGGAGTGGCA GGAGTGTCAG
CTCAAGGCCA GGTTCCTGCG TGAATGCGAT CCGGGCGTGT TGCTGATCGA GACGCTGCGT
CGCGAGGAAG GCAGTTATCC GCGTCTGGCT GGGCATCTGG CTCGCCTGCG CCAGTCGGCG
GCTTGGCTGG GCTTTCCTTA TGACGAACAG CGAGTCAAGG ATCTACTGGC CGAACAGCCG
GAGATCGGCT TCTGGCGGGT GCGTCTGACG CTGGCTAAAG ATGGCCGGCT AGATGTGCAG
TCGTTTCCTC TGGCGGGTGA GCCTGATGTT CTACGCCAAG CATTGTTGGC ACCGGAGCCG
ATTTGTTCGA CTTACCCCTT GCGCCGACAC AAGACGACTG ATCGTGCGGT ATACGATGAA
GCGATCCGTG CTCTGGCGGG CGATCAGCAA TTGTTCGACG TGGTTTTTCT GAACGAGCGT
GGTGAGGTCG CCGAAGGGGC GCGTAGCAAC GTATTTGTCG AACGGGATGG CGTGTTCCTG
ACGCCACCTC TGGCGAGTGG CGCCTTGCCC GGTGTGTTGC GGGCTGAACT GCTGGCCGAT
GGGCGGGCAC GTGAAGCCGT CTTGTGGCCG GAAGATCTCG ACAGGGGATT CTACTTGGGC
AATGCCTTGC GCGGCCTGAT TTGGGTATCC CTACGGACCG ACGCCTGA
 
Protein sequence
MIECLEVRDN ASLAEALQRL EDEPAWSVAA LDYELGYLLE PKSAPPDWVP GEKPLARFWR 
FARRQPLMAD AVEAWLNKQT GKAIAGVGDL HPIISEQHYV AAVERIKQLI FAGDCYQVNL
TFPLEFDWFG SPLALYARLR ERQPVRYGGF VGDASQGLVS LSPELFLERL GDRLLTRPMK
GTAPRGVPAE QLRNSAKDCA ENLMIVDLLR NDLGRVADNG SVVVDRLFEI EDYPTVWQMV
SEVSARAGRS SFGEILRALF PCGSITGAPK IRAMQIAAEL EGALRGIYTG AFGWIAPDGD
FRLNVAIRTL ALQAGGRGKL GIGSGIVADS QPLAEWQECQ LKARFLRECD PGVLLIETLR
REEGSYPRLA GHLARLRQSA AWLGFPYDEQ RVKDLLAEQP EIGFWRVRLT LAKDGRLDVQ
SFPLAGEPDV LRQALLAPEP ICSTYPLRRH KTTDRAVYDE AIRALAGDQQ LFDVVFLNER
GEVAEGARSN VFVERDGVFL TPPLASGALP GVLRAELLAD GRAREAVLWP EDLDRGFYLG
NALRGLIWVS LRTDA