Gene YpAngola_A2312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2312 
SymboltrpE 
ID5800782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2427571 
End bp2429136 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content51% 
IMG OID641340202 
Productanthranilate synthase component I 
Protein accessionYP_001606747 
Protein GI162420907 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAA CATCACGTCC TACTTTACAG TTACTCACCG CTAGCGCCTG TTACCGCGAT 
GACCCGACAG CGCTGTTTCA TCAATTATGT GGTGCCCGCC CCGCGACCCT GTTGCTTGAG
TCTGCAGAAG TTGACAACAA ACAGAATCTG AAAAGCCTGT TGGTCATCGA CAGTGCCTTG
CGTATCACGG CATTAGGGCA AACCGTCACT CTCGAAGCAT TGACCCGTAA TGGGGCTTCT
TTATTACCAC TGCTGGATGC CAGTCTGCCC ACTGAAGTCG ATATTCAGGT TCGTCCGAAT
GGCCGAGAGT TAACTTTCCC GTTAATAAAT GAAGTCCAGG ATGAGGATTC ACGCTTACAG
TCCTTATCTG TTTTTGATGC TTTACGTCAA TTATTAACAC TGGTTAATAC CCCACTTGCC
GAGCGTGAGG CCCTGTTTTT GGGCGGTTTG TTCGCTTACG ATTTAGTCGC TGGTTTTGAA
AATTTGCCTC CGTTGCGTCA GGATCAACGC TGCCCTGACT TCTGTTTCTA TTTAGCCGAG
ACATTGCTGG TTTTGGATCA TCAGCATCGT TCAACTCGCT TGCAGGCCAG CCTGTTCACG
CCAGACAGTT CAGAGTATCA GCGCCTGGCG ACCCGTTTAG AGCAACTCAG CCACCAGTTA
CAACAAGCGC CACACCCCAT TCCTGCCACC TCCGTGCCAG AGATGGCGTT ACAGTGTAAC
CAATCAGATG AAGAGTATTG CAACGTTGTC AGTGAATTGC AGGTAGCAAT CCGTGAAGGT
GAGATTTTCC AGGTGGTCCC ATCCCGCCGT TTTACGCTGC CCTGCCCATC ACCGCTGGCG
GCCTATCAGA CACTGAAAGA CCATAATCCC AGCCCCTACA TGTTTTTCAT GCAAGACAAT
GATTTTTCTC TGTTTGGTGC ATCACCTGAG AGCGCACTGA AATACGATGC CAGCAACCGT
CAAATTGAGA TTTACCCGAT TGCGGGTACT CGTCCACGCG GTCGTCGTCC TAATGGAGAA
CTCGATCGTG ATTTAGACAG CCGTATCGAG TTGGAAATGC GTACTGACCA TAAAGAGATG
GCAGAACATT TAATGTTGGT GGATCTGGCT CGTAACGATC TAGCGCGTAT TTGCGAACCC
GGTAGCCGCT ATGTTGCAGA TTTAACCAAA GTTGACCGTT ACTCCTTTGT CATGCATCTG
GTGTCCCGCG TGATTGGCAC TCTGCGTCAA GATTTAGATG TGCTGCATGC TTATCAAGCG
TGTATGAACA TGGGTACGCT AAGCGGTGCG CCCAAAGTAC GGGCCATGCA ATTGATTGCC
AGTAATGAAG GCTCACGCCG TGGCAGCTAC GGCGGTGCAG TGGGCTACTT CACTGCTCAC
GGTGATTTGG ATACCTGCAT TGTGATTCGT TCTGCCTACG TAGAGGACGG CATCGCCACC
GTACAAGCGG GTGCAGGGGT GGTTTTGGAC TCAGTTCCAC AAGCAGAAGC TGATGAAACT
CGGAATAAAG CCAGAGCCGT ACTGCGCGCC ATTGCCACTG CCCATCATGC CAAGGAGATT
TTCTAA
 
Protein sequence
MMQTSRPTLQ LLTASACYRD DPTALFHQLC GARPATLLLE SAEVDNKQNL KSLLVIDSAL 
RITALGQTVT LEALTRNGAS LLPLLDASLP TEVDIQVRPN GRELTFPLIN EVQDEDSRLQ
SLSVFDALRQ LLTLVNTPLA EREALFLGGL FAYDLVAGFE NLPPLRQDQR CPDFCFYLAE
TLLVLDHQHR STRLQASLFT PDSSEYQRLA TRLEQLSHQL QQAPHPIPAT SVPEMALQCN
QSDEEYCNVV SELQVAIREG EIFQVVPSRR FTLPCPSPLA AYQTLKDHNP SPYMFFMQDN
DFSLFGASPE SALKYDASNR QIEIYPIAGT RPRGRRPNGE LDRDLDSRIE LEMRTDHKEM
AEHLMLVDLA RNDLARICEP GSRYVADLTK VDRYSFVMHL VSRVIGTLRQ DLDVLHAYQA
CMNMGTLSGA PKVRAMQLIA SNEGSRRGSY GGAVGYFTAH GDLDTCIVIR SAYVEDGIAT
VQAGAGVVLD SVPQAEADET RNKARAVLRA IATAHHAKEI F