Gene YpsIP31758_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1932 
SymboltrpE 
ID5386011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2236052 
End bp2237617 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content50% 
IMG OID640864916 
Productanthranilate synthase component I 
Protein accessionYP_001400907 
Protein GI153947463 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.924364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAAA CATCACGTCC TACTTTACAG TTACTCACCG CTAGCGCCTG TTACCGCGAT 
GACCCGACAG CGCTGTTTCA TCAATTATGT GGTGCCCGCC CCGCGACCCT ATTGCTTGAG
TCTGCAGAAG TTGACAACAA GCAGAATCTG AAAAGCCTGT TGGTCATCGA TAGCGCCTTG
CGTATCACGG CATTAGGGCA AACCGTCACT CTCGAAGCAT TGACCCGTAA TGGGGCTTCT
TTATTACCAC TGCTGGATGC AAGTTTGCCC ACTGAAGTCG ATATTCAGGT TCGTCCGAAT
GGCCGAGAGT TAACTTTCCC GTTAATAAAT GAAGTTCAGG ATGAGGATTC ACGCTTACAG
TCCTTATCTG TTTTTGATGC TTTACGTCAA TTATTAACAC TGGTTAATAC CCCACTTGGC
GAGCGTGAGG CCCTGTTTTT GGGCGGTTTG TTCGCTTACG ATTTAGTCGC TGGTTTTGAA
AATTTGCCTC CGTTGCGTCA GGATCAACGC TGCCCTGACT TCTGTTTCTA TTTAGCCGAG
ACATTGCTGG TTTTGGATCA TCAACATCGT TCAACTCGCT TGCAGGCCAG CCTGTTCACG
CCAGACAGTT CAGAGTATCA GCGCCTGGCG ACCCGCTTAG AACAACTCAG CCACCAGTTA
CAACAAGCGC CACACCCCAT CCCTGCAACC TCAGTCCCGG AGATGACGTT ACAGTGTAAC
CAATCAGATG AAGAGTATTG CAACGTTGTC AGTGAATTGC AGGTAGCAAT CCGTGAAGGT
GAGATTTTCC AGGTGGTCCC ATCCCGCCGT TTTACGCTGC CCTGCCCATC ACCGCTGGCG
GCCTATCAGA CACTGAAAGA CCATAATCCC AGCCCCTACA TGTTTTTCAT GCAAGACAAT
GATTTTTCTC TGTTTGGCGC ATCACCTGAA AGCGCACTGA AATACGATGC CAGCAACCGT
CAAATTGAGA TTTACCCGAT TGCGGGTACT CGTCCACGCG GTCGTCGTCC TAATGGAGAA
CTCGATCGTG ATTTAGACAG CCGTATCGAG TTGGAAATGC GTACTGACCA TAAAGAGATG
GCAGAACATT TAATGTTGGT GGATCTGGCT CGTAACGATC TGGCACGTAT TTGCGAACCC
GGTAGCCGCT ATGTTGCAGA TTTAACCAAA GTTGACCGTT ACTCTTTTGT CATGCATCTG
GTGTCCCGCG TGATTGGCAC TCTGCGTCAA GATTTAGATG TGCTGCATGC TTATCAAGCG
TGTATGAACA TGGGTACGCT AAGCGGTGCG CCCAAAGTAC GGGCCATGCA ATTGATTGCC
AGTAATGAAG GCTCACGCCG TGGCAGCTAC GGCGGTGCAG TGGGCTACTT CACTGCTCAC
GGTGATTTGG ATACCTGCAT TGTGATTCGT TCTGCCTACG TAGAGGACGG CATCGCCACC
GTACAAGCGG GTGCAGGGGT GGTTTTGGAC TCAGTTCCAC AAGCAGAAGC TGATGAAACT
CGGAATAAAG CCAGAGCCGT ACTGCGCGCC ATTGCCACTG CCCATCATGC CAAGGAGATT
TTCTAA
 
Protein sequence
MMQTSRPTLQ LLTASACYRD DPTALFHQLC GARPATLLLE SAEVDNKQNL KSLLVIDSAL 
RITALGQTVT LEALTRNGAS LLPLLDASLP TEVDIQVRPN GRELTFPLIN EVQDEDSRLQ
SLSVFDALRQ LLTLVNTPLG EREALFLGGL FAYDLVAGFE NLPPLRQDQR CPDFCFYLAE
TLLVLDHQHR STRLQASLFT PDSSEYQRLA TRLEQLSHQL QQAPHPIPAT SVPEMTLQCN
QSDEEYCNVV SELQVAIREG EIFQVVPSRR FTLPCPSPLA AYQTLKDHNP SPYMFFMQDN
DFSLFGASPE SALKYDASNR QIEIYPIAGT RPRGRRPNGE LDRDLDSRIE LEMRTDHKEM
AEHLMLVDLA RNDLARICEP GSRYVADLTK VDRYSFVMHL VSRVIGTLRQ DLDVLHAYQA
CMNMGTLSGA PKVRAMQLIA SNEGSRRGSY GGAVGYFTAH GDLDTCIVIR SAYVEDGIAT
VQAGAGVVLD SVPQAEADET RNKARAVLRA IATAHHAKEI F