Gene Anae109_1629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1629 
Symbol 
ID5374283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1832112 
End bp1833773 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content77% 
IMG OID640843138 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001378817 
Protein GI153004492 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.810406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.772192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACCG ACGACCCCGC CCGCGTCCGG GCCGCCCTCG CGGAGGTCGA AGGCGAGGCG 
CGCCGCGGTC GATGGGCGGC TGGGTACGTG GCCTACGAGG CTGCGACCGG GCTCGAGCCC
GCGCTCGCCG TCCGCGGCCG CTCGGGACCG CTCTTGTGGT TCGGCATCCA CGACGCGCCG
GCGAACCCAT CGGCGCCGGC CGCGGGGGCG ATCGCGGGAG CGCGCGTCGG GGCGCTCGCG
CCGGAGGTCA CGCGCGCAGA GCACGTCGCC GGAGTGGAGA CGGTGCGCGC CGCGCTGGGA
CGCGGGGACG CCTACCAGGT GAACCTGACC TTCCGCATGC GCGGGAGCTT CGACGGCGAT
CCCTTCGCGC TGCACGAGCG GCTCCGCGGC GCGCAGGGCG GCGGGTACAC CGGCTGCCTC
GTCGTGGACG GGCGCGCGGT GGTGTCCGCG TCGCCCGAGC TGTTCTTCCT CCGGCGCGGA
GACGCGATCC TCGTCCGGCC GATGAAGGGG ACCGCCCGGC GCGGCCGGAC CCTCGCCGAG
GACGAGCGTG CGGCGAAGAC GCTGGCGGCC TCGCCGAAGG AGCGCGCCGA GAACGTCATG
ATCGTCGACC TGCTCCGCAA CGACCTCGGC CGCGTCGCGC GAACCGGCTC GGTGCGGGTG
GCCGAGCTGT TCACGGTCGA GCGCTACCGG ACGGTGCTGC AGCTCACCTC GACCGTCGAG
GCGCGCCTCG CTCCCGCGGT CGGCCTCGCC GAGCTCTTCG CGGCCCTGTT CCCGTGCGGC
TCGGTCACGG GGGCGCCGAA GATCGCGGCG ACGCGGATCA TCGCGGCGCT GGAGCGGAGC
CCGCGCGGCC CGTACTGCGG CGCCCTCGGC GTCGTGGCGC CGGGCGGCGA CGCGGTGTTC
AACGTGGCGA TCCGCACGCT CGACCTCGAC CTCGAGCGCG GCCTGGCGAC CTACGGCGTC
GGCGGCGGCA TCACCTGGGG CTCGGATCCC GGGCGCGAGT GGGACGAGGC GATGGCGAAG
GCCGAGGTGC TCGCCGAGCC GGCGGAGGAC CTCGAGCTGC TCGAGACCCT CCGCCTGGAG
CGGGGCGTCT ACGCGCGGCT CGACCGGCAC CTGGCCCGGC TCGAGGCGTC GGCGCGCTAC
TTCGGGATCC CCGTGGACCT CGCGGCGGTG CGAGCGGCCC TCGACGCGGA GGCGCGGAGC
GCGCCCGCCG AAGGGGCGCG CGTCCGGCTC CTCGTCGGGG CCGACGGACG GCCGCGGACG
GAGTCGGCGG CGCTCCCGGC GGCGTCGGCG GAGCCGCTGC CGGTGGCGCT CGCGCGGGCC
CCCGTCGATC GCGCGGATCG GCTCCTCTTC CACAAGACCA CGCGCCGCGC GGTGTACGAC
GCCCGGCGCG CCGAGCGGCC CGACGTCTTC GACGTGCTGC TCTCGAACCG CGAGGGCGAG
CTGACCGAGC TCACCATCGG CAACCTCGTC GTCGAGCTCG GCGGCGAGCG GCTCACCCCG
GCCCTCGACT CGGGCCTGCT CGCCGGGACC CTGCGCGCGG AGCTGCTCGA GCGGGGAGAG
GTTCGCGAGG CCGTGCTGCG CGTCGCCGAC CTCGAGCGCG CCGCGCGGCT GTGGCTCGTG
AACTCGCTGC GGGGGTGGGT GCCGCTCCGG CTGGTCCGGT GA
 
Protein sequence
MATDDPARVR AALAEVEGEA RRGRWAAGYV AYEAATGLEP ALAVRGRSGP LLWFGIHDAP 
ANPSAPAAGA IAGARVGALA PEVTRAEHVA GVETVRAALG RGDAYQVNLT FRMRGSFDGD
PFALHERLRG AQGGGYTGCL VVDGRAVVSA SPELFFLRRG DAILVRPMKG TARRGRTLAE
DERAAKTLAA SPKERAENVM IVDLLRNDLG RVARTGSVRV AELFTVERYR TVLQLTSTVE
ARLAPAVGLA ELFAALFPCG SVTGAPKIAA TRIIAALERS PRGPYCGALG VVAPGGDAVF
NVAIRTLDLD LERGLATYGV GGGITWGSDP GREWDEAMAK AEVLAEPAED LELLETLRLE
RGVYARLDRH LARLEASARY FGIPVDLAAV RAALDAEARS APAEGARVRL LVGADGRPRT
ESAALPAASA EPLPVALARA PVDRADRLLF HKTTRRAVYD ARRAERPDVF DVLLSNREGE
LTELTIGNLV VELGGERLTP ALDSGLLAGT LRAELLERGE VREAVLRVAD LERAARLWLV
NSLRGWVPLR LVR