Gene Pmen_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmen_3997 
Symbol 
ID5110220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas mendocina ymp 
KingdomBacteria 
Replicon accessionNC_009439 
Strand
Start bp4369858 
End bp4371348 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content65% 
IMG OID640505260 
Productanthranilate synthase component I 
Protein accessionYP_001189476 
Protein GI146309011 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.346347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.520041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATG AAGAATTCCT GCGTTTAGCC GCCGAAGGCT ACAACCGCAT TCCGCTGGCC 
TGCGAAACCC TGGTGGATTT CGACACGCCG CTGTCCATCT ACCTGAAACT GGCCGATGCG
CCCAACTCCT ACCTGCTCGA ATCCGTGCAG GGCGGCGAAA AGTGGGGCCG TTATTCGATC
ATCGGCCTGC CGGCGCGCAC CGTGCTGCGC GTGCATGGCC ATGATGTCGT GGTCAGCACC
GACGGCGTCG AGGTCGAGCG CCACGAGTGC GCCGACCCGC TGGAGTTCGT CGAGCAGTTC
AAGGCGCGCT ACAAGGTGCC GACCATCGCC GGTCTGCCGC GCTTCAACGG TGGCCTGGTG
GGCTACTTCG GCTATGACAG CGTGCGTTAC GTCGAGCCCA AGCTGGCTGC CGGGGTGAAT
CCCGATGCGC TGGGCACGCC GGACATCCTG CTGATGGTCT CCGACGCGGT GGTGGTGTTC
GACAACCTGG CCGGCAAGAT GCACGGCATC GTTCTGGCCG ACCCGGCGCA GCCGGATGCC
TTCGAGCAAG GCCATGTGCG TCTGCGCGAG ATCCTGCACA AGCTGCGCCA GCCTATCACC
CCACGCCTGG GCGTGGACCT GTCTGCGCCT TTGGGCGAGG AGCCGGCGTT TCGCTCCAGC
TACAGCCAGG GCGACTACGA GGCGGCGGTC CGTTCGATCA AGGAGTACAT CCTGGCCGGC
GACTGCATGC AGGTGGTGAT CTCCCAGCGC ATGTCGATTC CGTTCAAGGC CGCGCCCATC
GACCTGTACC GCGCGCTGCG CTGCATCAAC CCGACGCCCT ACATGTACTT CTTCAACTTC
GGCGACTTTC ATGTGGTCGG CTCCTCGCCC GAGGTGCTGG TGCGCGTCGA GGACGGCCTG
GTCACCGTGC GCCCCATCGC CGGTACGCGT CCGCGCGGCG CCAGCGAGGA GGCGGACAAG
GCGCTGGAAG ACGACCTGCT GTCCGACACC AAGGAAATCG CCGAGCACCT GATGCTGATC
GACCTGGGCC GTAACGACAC CGGCCGCGTG TCGCAGATCG GCTCGGTCAA GCTGACCGAG
AAGATGGTCA TCGAGCGCTA TTCCAACGTC ATGCATATCG TCTCCAACGT CACCGGTCAG
CTGAAACCCG AGCTTTCGGC GATGGACGCG CTGCGCGCCA TCCTGCCGGC CGGCACCTTG
TCCGGCGCGC CGAAGATCCG CGCCATGGAA ATCATCGATG AGCTGGAGCC GGTCAAGCGC
GGCGTCTACG GCGGCGCAGT CGGCTACTAC GCCTGGAACG GCAACATGGA CACCGCCATC
GCCATTCGTA CCGCGGTGAT CAAAGACGGC GAGCTGCACG TACAGGCCGG CGCCGGCATC
GTCGCCGACT CGGTGCCCGC GCTGGAGTGG GAAGAAACCC TGAACAAGCG CCGCGCCATG
TTCCGCGCCG TCGCGCTCGC CGAACAGACT GCCGTCCAGG CCCAGGAATA A
 
Protein sequence
MTHEEFLRLA AEGYNRIPLA CETLVDFDTP LSIYLKLADA PNSYLLESVQ GGEKWGRYSI 
IGLPARTVLR VHGHDVVVST DGVEVERHEC ADPLEFVEQF KARYKVPTIA GLPRFNGGLV
GYFGYDSVRY VEPKLAAGVN PDALGTPDIL LMVSDAVVVF DNLAGKMHGI VLADPAQPDA
FEQGHVRLRE ILHKLRQPIT PRLGVDLSAP LGEEPAFRSS YSQGDYEAAV RSIKEYILAG
DCMQVVISQR MSIPFKAAPI DLYRALRCIN PTPYMYFFNF GDFHVVGSSP EVLVRVEDGL
VTVRPIAGTR PRGASEEADK ALEDDLLSDT KEIAEHLMLI DLGRNDTGRV SQIGSVKLTE
KMVIERYSNV MHIVSNVTGQ LKPELSAMDA LRAILPAGTL SGAPKIRAME IIDELEPVKR
GVYGGAVGYY AWNGNMDTAI AIRTAVIKDG ELHVQAGAGI VADSVPALEW EETLNKRRAM
FRAVALAEQT AVQAQE