Gene Daro_3481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3481 
Symbol 
ID3566941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3729507 
End bp3730982 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content62% 
IMG OID637681953 
Productanthranilate synthase component I 
Protein accessionYP_286680 
Protein GI71909093 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.219559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA CCGAATTCAA TTCGCTTGCC GCGCAAGGCT ACAACCGTAT TCCCGTCACG 
CTGGAAACGT TTGCCGATCT CGACACGCCG CTCTCCATCT ACCTGAAGCT GGCCAATGCG
CCCTACACCT ACTTGCTCGA ATCGGTGCAG GGCGGTGAGC GCTTCGGTCG CTACTCGATC
ATCGGTCTGG CTGCCCAGAC GCGTATCGTC GTCAATGGCC ACCAGGTGCT GGTGCTGACC
GGCAACCGTA TCGCCGAGCG TGAAAACGAC ACCAACCCGC TGGAATTCAT CGGCAAGTTC
ATGCAGCGCT TCCGGGCGCC GCCGGCGAAC GGCCTGCCAC GCTTCTGCGG CGGCCTGGTC
GGTTGCTTCG GCTACGACAC CGTGCGTTAC GTCGAAACCC GCCTGACCCG CACCAACAAG
CCGGACGAAA TCGGCACGCC GGACATCGGC CTATTGCTTT CCGAAGAAAT CGCTGTCGTC
GACAACCTGT CCGGCAAGCT GACGCTGATC GTCTATGCCG AGCCCGGCTT CCCCGGTGCC
TATCAGAAGG CCCGCGCCCG TCTCAAGGAA TTGCTGCAGA AACTGCGCAC GCCGGTCTCC
CTGCCCAGCG AGCAGCCGGT CCATTCGGAA GCGGCCGTTT CCGTCTTTGG CGAAGCGGCC
TTCAAGCAGG CTGTGCTCAA GGCCAAGGCC TACATCACCG AAGGTGACAT CATGCAGGTC
GTGCTGTCGC AGCGCATGAC CAAGCCCTTC CTGGCGAGCC CTCTGGCGCT CTATCGCACC
CTGCGCAGCC TGAATCCGTC GCCCTACATG TTCTACTTCG ACTTCGAGGA TTTCCACGTG
GTCGGCGCCT CGCCGGAGAT TCTGGTTCGC CTCGAAGGCG AGCGCGTCAC GGTTCGGCCG
ATTGCCGGCA CCCGCAAGCG CGGTGCTTCG CCGGAAGAGG ATGCAGCTCT GGCCGTCGAA
CTGCTGGCCG ATGAAAAAGA ACGGGCCGAA CATACCCAGT TGCTCGACCT CGGCCGCAAC
GACTGCGGGC GTGTCGCGCG TGTCGGTTCG GTCAAGCTGA CCGAAAACAT GATCGTCGAG
CGTTATTCGC ACGTGATGCA TATCGTTTCC AATGTCGAGG GCAAGCTGCA GCCGGGTCTG
GATGCACTTG ACGTGCTGCG CGCCACCTTC CCGGCCGGCA CCGTCTCCGG TGCGCCCAAG
GTGCGGGCGA TGGAAATCAT CGACGAACTG GAACCGGTCA AGCGTGGCAT CTACGCCGGT
TCGGTCGGCT ATCTCGGTTT CAACGGCGAC ATGGATGTGG CCATTGCCAT CCGCACGGCT
GTGCTCAAGG ACAAGAAGCT CTATGTGCAG GCCGGTGCCG GGATCGTCGC CGATTCCGAT
CCGAATTCGG AATGGACCGA AACCCTGAAC AAGGCGCGTG CCGTGCTGCG TGCGGCCGAA
CTGGCCGAGC AGGGTCTGGA TACAAGGATC GACTGA
 
Protein sequence
MTETEFNSLA AQGYNRIPVT LETFADLDTP LSIYLKLANA PYTYLLESVQ GGERFGRYSI 
IGLAAQTRIV VNGHQVLVLT GNRIAEREND TNPLEFIGKF MQRFRAPPAN GLPRFCGGLV
GCFGYDTVRY VETRLTRTNK PDEIGTPDIG LLLSEEIAVV DNLSGKLTLI VYAEPGFPGA
YQKARARLKE LLQKLRTPVS LPSEQPVHSE AAVSVFGEAA FKQAVLKAKA YITEGDIMQV
VLSQRMTKPF LASPLALYRT LRSLNPSPYM FYFDFEDFHV VGASPEILVR LEGERVTVRP
IAGTRKRGAS PEEDAALAVE LLADEKERAE HTQLLDLGRN DCGRVARVGS VKLTENMIVE
RYSHVMHIVS NVEGKLQPGL DALDVLRATF PAGTVSGAPK VRAMEIIDEL EPVKRGIYAG
SVGYLGFNGD MDVAIAIRTA VLKDKKLYVQ AGAGIVADSD PNSEWTETLN KARAVLRAAE
LAEQGLDTRI D