Gene Ppha_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_0502 
Symbol 
ID6462030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp492492 
End bp494624 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content59% 
IMG OID642726790 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_002017445 
Protein GI194335651 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGGAG AGAGCCTTTT CGAACGACTC GCCGCCACCC TTGCAACCGA GGGCACCCTC 
TGGCTCGAAT CCGCCTTCTG TAACGAGCCG GAGGGCGGAG CGCTCCTCTT CTCTGACCCG
CTGGAGGTGG TGACCCTTCC CACACTCAAC GATATCGAGC TTTTTTTCCG GAGGATCGAG
GAACAGAGCG CCGCAGGTTT TCATCTTGCC GGCTGGCTCA CCTATGAGGC CGGTTACGCC
TTTGAAGAGA GGCTTCTTCA CGGCCTCTCA CCCGACGACC CTCATCTTCC CCCCTCGCCC
CTTGGCTGGT TTGGAGTCTA CCGGGAACCG GAGCGCTTTT CAGCCGCAGA GGTTCAGGAG
CTTTTCGCCG AACCGGGCCC TCGCTCCGAC CTCATCATCA CCGACCTCTC CTTCAATCTC
TCCGAAGAGG AATACGGCAG AAAAATTGAA CGCATCAAAG AGCAGATAGC AGCAGGCAAT
CTCTATCAGG TGAACTTTAC CGGACGCTAC CGCTTCACCA GCAACAGCGA GCCAACGGCG
CTCTTTGCCG CGCTGCGGGG TCGGCAACCC TCATCCTACA CCGCCTGTAT CAACAGCGGC
GGGCGCACCA TCCTCTCCTG CTCTCCCGAA CTCTTTTTCA GGCGGCGCGG CTCACTCATC
GAAACCATGC CGATGAAAGG AACCGCACCG AGAGGCCGCA CCATCGAAGA GGACAACCGC
CTCAGGGAGG GGCTTGCCGG ATGCCAGAAA AACCGGGCCG AAAATCTCAT GATCGTCGAT
CTCCTCCGTA ACGACCTCGG TCGGATATGC AAGCCCGGCA CAGTCGAAGC AGGGGATCTC
TTCACCATCG AAACATGGCC GACACTCCAC CAACTGCTCT CAACCATCCG GGGAGAGCTG
CGGGAGGGAA TCAAGCTCAG CGAACTCTTC CGGGCCCTCT ATCCCTCAGG CTCCATTACC
GGTGCGCCCA AAATAAGCGC CATGAAGCTC ATACAGAGCC TTGAACCAAC GTCAAGAGGG
ATCTATACCG GCACCATCGG CTATATCACC CCCGACAGCG AGATGGTTTT CAGTGTCGCA
ATCCGCACCA TCGAGCTATC CGGAAAACAG GGCACCTACG GATCAGGAGG CGGCATCGTA
TGGGACTCCG ATCCCCAGGA TGAGTACCGC GAGTGCCAGC TCAAGGCCAA GATCCTTACC
AACCCCGGCA GGAGCCGCAG CGCTGCAGCA ACAGAGACCG GCATTGCGCC AGGCAGCGAT
AGAGCAGCAC AGCAACCTGA AGAGCACCGC ATTCAGCTCA TTGCACCTGC ATCAGCAGAG
GACGTTGTTC TGGGCTGTAA CTTCGGTCAA ACGGACGCCC ATCATCAGCC TCCGGGTGGG
GTTTCCGGCA GCGAAGCAGG GCAACACTTC GGGCTTTTTG AAAGCCTGCT CTGGAACGGC
AACTATCTCT GGCTCGACGA ACACCTCCAG CGCCTTGCCG CATCAGCCGC CACGCTCGGC
TTTCCATGCG ACACCGCCGC TGCCACCCAC CTGCTTCACC AGCTTGAAGA GGAGATGCGC
CATCATGGCA GCCAGAACAG CAACCCGCAA CCCGGCCAGC AGCAGGAACA TCTCCGCACC
GAAGAACAGT GCCGGGGTGC CGCACGCTGC AAGGTACGCC TCAGCCTCAC AAGAGAGGGA
AGCTGCAGCG CCAGTTACGA GCCCATAACG GTGCAAGGCT CCACCACACC GCTCCGTTTA
TGCATTGCCG CAGAGCCCAC CCATTCATCC AACCCCCTGC TCCGGCACAA AACCACCAAA
AGGGAGCTTT ACGACCACTA CTTCACCCTT GCCCGCCAGC AAGGCTACGA CGAAATCCTC
TTCCACAACG AACGCGGCGA GATCACCGAA GGCGCCATCA GCACCATCTT TATCCGCAAA
GGCCAGCAAC TCTGCACCCC GCCACTCCAC TGCGGCCTGC TCAACGGCAT TTTCCGCCAT
TACATCCTCG CCACCCGCCC CACCGCAACC GAAAAAATCA TCACCATCAA CGACCTCGCC
ACCGCCGACG CCATATTCAT CGCAAATTCA GTCCGAGGCC TCAGACCCGC CACAATGTGT
AAAAACCCGC CGACCATGGG ACCTGATACA TAA
 
Protein sequence
MKGESLFERL AATLATEGTL WLESAFCNEP EGGALLFSDP LEVVTLPTLN DIELFFRRIE 
EQSAAGFHLA GWLTYEAGYA FEERLLHGLS PDDPHLPPSP LGWFGVYREP ERFSAAEVQE
LFAEPGPRSD LIITDLSFNL SEEEYGRKIE RIKEQIAAGN LYQVNFTGRY RFTSNSEPTA
LFAALRGRQP SSYTACINSG GRTILSCSPE LFFRRRGSLI ETMPMKGTAP RGRTIEEDNR
LREGLAGCQK NRAENLMIVD LLRNDLGRIC KPGTVEAGDL FTIETWPTLH QLLSTIRGEL
REGIKLSELF RALYPSGSIT GAPKISAMKL IQSLEPTSRG IYTGTIGYIT PDSEMVFSVA
IRTIELSGKQ GTYGSGGGIV WDSDPQDEYR ECQLKAKILT NPGRSRSAAA TETGIAPGSD
RAAQQPEEHR IQLIAPASAE DVVLGCNFGQ TDAHHQPPGG VSGSEAGQHF GLFESLLWNG
NYLWLDEHLQ RLAASAATLG FPCDTAAATH LLHQLEEEMR HHGSQNSNPQ PGQQQEHLRT
EEQCRGAARC KVRLSLTREG SCSASYEPIT VQGSTTPLRL CIAAEPTHSS NPLLRHKTTK
RELYDHYFTL ARQQGYDEIL FHNERGEITE GAISTIFIRK GQQLCTPPLH CGLLNGIFRH
YILATRPTAT EKIITINDLA TADAIFIANS VRGLRPATMC KNPPTMGPDT