Gene Paes_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0455 
Symbol 
ID6460783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp488100 
End bp489956 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content53% 
IMG OID642724454 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_002015158 
Protein GI194333298 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.68845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGT CTCATGATCG TATCCTGACT TCTCTTCCGC CAAGCACGCT CTGGTTCGGG 
GGGTTGTTTT CCAGCAGCAG TGCGTCTCGT GGGCTCTGTT TCGAAGATCC GCTTGAATGT
CTTTCGCTTA CTTCATCAGA GGAGACAGAT GCATATTTTT CTCTGCTTCA GGAGCGGCTT
GAAGAGGGAT ATGCGCTTGC CGGTTATGTC GGTTATGAGG CCGGTTATGG GTTTGAACCG
ACGGCATTCA ACCCTGTTGA AGAGCTAACC CGGGATGAAC TTCCTCTTGC CTGGTTCGGC
GTCTATCACG CTCTGCGAGA GTGCGACTTC GGGGAGATAA ACCCTTCGCA TACGTCAGGA
TGTGCTCCCG GGTTTGATAT GACCCTTGAC GCATATCGTA CGAAAATAGA GATGATCCTC
GATCGCATTG CCGCGGGGGA TGTTTACCAG GTGAATTTTA CCGGACGCTA TTCGTTTGAT
TTTGACGGCC CGCCTGACAC GCTTTTTCAC GCACTTTCGG CAAGACAGCC ACAATCCTAC
AGGGCCTGGC TCAATATCGA TGGGCATCAG ATCATGTCGT TTTCCCCTGA GCTGTTTTTT
TCCAGAAAGG GGCGTTGCAT ACACACGTGT CCGATGAAGG GGACTGCGCC CAGGGGAAAG
AGTGTCGAGG AGGACGAAGA ACTCCGTGAA GGACTGCGGC ATTCTGAAAA GAACCGTGCA
GAAAATCTTA TGATCGTTGA TCTTCTGCGC AATGATCTTG GCCGAATCTG CACTCCCGGA
AGCGTGACAG TTCCGGAGCT TTTCACGATA AGAAGCTACC CGACACTGCA CCAGATGGTC
TCTTCTGTCG ATGGAGAGCT TGAAGGTGAG TGTGATCTCA GGGGGTTGTT TCGCGCGATA
TTTCCCTGCG GTTCAGTTAC CGGTGCGCCG AAAATAAGGG CGATGCAGCT TATCCGGCAA
CTTGAACAAT CTCCACGGGG GGTCTATACC GGTGCGGTCG GGTTCATGCT GCCGGACATG
ACAATGGAGT TCAATGTCGC TATCCGAACG CTGACGCTCA ACCAGGGGCA TGGAACCTAT
GGGGCTGGCA GCGGGATTGT CTGGGATTCG GATGCCGATG ATGAGTTTGG CGAATGTGGA
CTGAAAGCCG GGATTCTTCT GGATCACTCT CCTGAAACGC TGAGTCTTTT CGAAACATTG
CTCTGGAACG GCAGCTATCT ATGGCTGGAA GAGCATCTTG ATCGCCTTCG CCGTTCTGCC
GAAACGCTTG GATTTTCCTG TTCGCGCAAG GCGATACGAT CAAAACTCGA TGCCTATGCA
TCTGCCCGCC TTCAGCGCCA GGGGACAAGC CGGGTCAGAG TTGTTCTGGA GAGGGACGGT
CGTTTTTCTG TTTCTTCAGA GCCGCTCGGT CGGGAGGCAT TTCTGTCTCC AGTGAAGGTC
TGTTTCGCCG CTCCGTTCAC CGATTCGAAG GATCCGATGC TCGGCCATAA AACAACGGCA
AGGCATCTCT ATGACCGGAT TTTGCGGCAG GCTGCTGCTG CAGGTTTTGA TGAGGTGCTT
TTCTGTAATG AAAGAGGGGA GATCACGGAG GGCGCGATCA GTAATGTGAT CATTATGAAA
GACGGACGGT ATTATACCCC GCCGCTTTCT TGCGGTATGC TTCCAGGTAT CTACCGCAGT
TATTTTCTGG CGACAAGGTC GAACGCTGTG GAAAAGGTAC TCTATCCGGA AGATCTTCTT
TCGGCCGATG CGGTCTTTGT CTGTAATTCA CTTCGCGCAA TGCGCCGTTC GATCCTTTTT
CCTTCGATGC TCATCGGTGA TGAATGCGCT GCAGCGGTTG ATGATCATAT GAAATAG
 
Protein sequence
MKASHDRILT SLPPSTLWFG GLFSSSSASR GLCFEDPLEC LSLTSSEETD AYFSLLQERL 
EEGYALAGYV GYEAGYGFEP TAFNPVEELT RDELPLAWFG VYHALRECDF GEINPSHTSG
CAPGFDMTLD AYRTKIEMIL DRIAAGDVYQ VNFTGRYSFD FDGPPDTLFH ALSARQPQSY
RAWLNIDGHQ IMSFSPELFF SRKGRCIHTC PMKGTAPRGK SVEEDEELRE GLRHSEKNRA
ENLMIVDLLR NDLGRICTPG SVTVPELFTI RSYPTLHQMV SSVDGELEGE CDLRGLFRAI
FPCGSVTGAP KIRAMQLIRQ LEQSPRGVYT GAVGFMLPDM TMEFNVAIRT LTLNQGHGTY
GAGSGIVWDS DADDEFGECG LKAGILLDHS PETLSLFETL LWNGSYLWLE EHLDRLRRSA
ETLGFSCSRK AIRSKLDAYA SARLQRQGTS RVRVVLERDG RFSVSSEPLG REAFLSPVKV
CFAAPFTDSK DPMLGHKTTA RHLYDRILRQ AAAAGFDEVL FCNERGEITE GAISNVIIMK
DGRYYTPPLS CGMLPGIYRS YFLATRSNAV EKVLYPEDLL SADAVFVCNS LRAMRRSILF
PSMLIGDECA AAVDDHMK