Gene PA14_32160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_32160 
SymbolantA 
ID4380577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp2796513 
End bp2797907 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content67% 
IMG OID639325156 
Productanthranilate dioxygenase large subunit 
Protein accessionYP_790725 
Protein GI116050456 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03228] anthranilate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.243379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTA CCCGCAGAAG CCTGGAACAG TGGCGCGACT ATGTCGGCGG CTGTCTCGAC 
TTCCGTCCCG AGGAAGGCAT CTTCCGCATC GCCCGGGACA TGTTCACCGA ACCGGAGCTG
TTCGACCTGG AGATGGAGCT GATCTTCGAA AAGAACTGGA TCTACGCCTG CCACGAAAGC
GAACTGGCCA GGCCCCACGA CTTCGTCACC CTGCGCGCCG GACGCCAGCC GCTGATCGTC
ACCCGCGACG GCAACGGCCA GTTGCACGCG CTGGTCAACG CCTGCCAGCA TCGCGGCGCG
ACCCTGGTGC GGGTCGGCAA GGGCAACCAG TCGACCTTCA CCTGTCCGTT CCATGCCTGG
TGCTACAAGA ACGACGGCCG GCTGGTGAAG GTCAAGGCGC CGGGCGAATA CCCGGAAGGC
TTCGACAAGG CCACCCGCGG CCTGAAGAAA GCGCGCATCC AGAGCTACAG GGGCTTCGTC
TTCGTCAGCC TGGACGTCGC CGGCGAGGAC GACCTGGTGG ACTTCCTCGG CGACGCCCGG
GTGTTCCTCG ACATGCTGGT GGCGCAGTCT CCCAGCGGCG AGCTGGAAGT GCTGCCCGGC
ACCTCCACCT ACACCTACGA AGGCAACTGG AAGCTGCAGA ACGAGAATGG CCTGGACGGC
TATCACGTCA GTACCGTGCA CTACAACTAC GTAGCCACCG TGCAGCACCG CCAGCAGGTC
GAGGCCGAGC GCGGCGGCGC GGCCGCCACC CTCGACTACA GCAAGCTCGG CGCCGGCGAC
GCGGCCACCG ACGACGGCTG GTTCTCCTTC GCCAACGGCC ACAGCGTGCT CTTCAGCGAG
ATGCCCAACC CCGCCGTACG CCCCGGCTAC GCCAGCGTGA TGCCGCGGCT GGTGGCGGAA
TACGGCCAGG CCCGCGCCGA GTGGATGATG CATCGCCTGC GCAACCTCAA TCTCTACCCC
AGCCTGTTCG TCATCGACCA GATCAGCTCG CAGCTGCGCA TCGTCCGCCC GCTGGCCTGG
AACCGTACCG AGATCGTCAG CCAGTGCATC GGCGTCAAGG GCGAGTCGGA CGCCGACCGG
GAGAACCGGA TCCGCCAGTT CGAGGACTTC TTCAACGTCT CCGGGATGGG CACGCCCGAC
GATCTGGTGG AGTTCCGCGA AGCCCAGCGT GGCTTCCAGG CCCGCCTGGA GCGCTGGAGC
GACATCTCCC GCGGCCACGG CAAGTGGCTC GAAGGCGCGA CGCCGAACAG CCAGGCGCTG
GGTATCGCGC CGCTGCTGAC CGGCACCGAG ATCACCCACG AAGGCCTCTA CGTCAACCAG
CACGCGCATT GGCGGCGCTT CCTCCTCGAC GGCCTGGAGC GCCTGGCCCT GCGCGCGAAG
GAGGTGACCC CATGA
 
Protein sequence
MNATRRSLEQ WRDYVGGCLD FRPEEGIFRI ARDMFTEPEL FDLEMELIFE KNWIYACHES 
ELARPHDFVT LRAGRQPLIV TRDGNGQLHA LVNACQHRGA TLVRVGKGNQ STFTCPFHAW
CYKNDGRLVK VKAPGEYPEG FDKATRGLKK ARIQSYRGFV FVSLDVAGED DLVDFLGDAR
VFLDMLVAQS PSGELEVLPG TSTYTYEGNW KLQNENGLDG YHVSTVHYNY VATVQHRQQV
EAERGGAAAT LDYSKLGAGD AATDDGWFSF ANGHSVLFSE MPNPAVRPGY ASVMPRLVAE
YGQARAEWMM HRLRNLNLYP SLFVIDQISS QLRIVRPLAW NRTEIVSQCI GVKGESDADR
ENRIRQFEDF FNVSGMGTPD DLVEFREAQR GFQARLERWS DISRGHGKWL EGATPNSQAL
GIAPLLTGTE ITHEGLYVNQ HAHWRRFLLD GLERLALRAK EVTP