Gene YPK_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2044 
SymboltrpD 
ID6087756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2275498 
End bp2276496 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content53% 
IMG OID641597111 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001720784 
Protein GI170024279 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACATT TATTCGAAAA ACTGTTCCGG GCTGAGTCAA TGAGCCAAGA AGAAAGCCAG 
CAACTGTTTG CGGCGATTGT ACGTGGTGAA CTCGATCCAA GCCAACTGGC CGCAGTGCTA
ATCAGCATGA AAGTACGCGG GGAGACCCCA GCGGAGATTG CCGGGGCAGC TCAAGCTTTA
CTGGCAGATG CGCAACACTT TCCACGCCCA GACTACCTGT TTGCCGATAT TGTCGGGACC
GGCGGTGATG GCACCAACAG TATTAATATC TCCACCGCCA GTGCCTTTGT CGCGGCTAGC
TGTGGCGTAA AAGTGGCTAA ACATGGTAAC CGCAGTGTTT CTAGCCGTTC CGGTTCATCC
GATCTGCTGG CGGCATTTGG TATCCGTTTG GACATGAGTG CCGAGCAATC ACGGTTGGCG
TTGGACGATC TCGGGGTCTG CTTCCTGTTT GCGCCGCAAT ATCACACGGG TTTTCGTCAT
GCGATGCCAG TACGCCAACA GTTAAAGACC CGCACCCTGT TTAATGTGTT GGGGCCGTTG
ATCAACCCCG CCCGCCCGCC GCTGGCGCTC ATTGGCGTCT ATAGCCCTGA GTTAGTGTTA
CCGATCGCTC AAACGCTGAA AGTGCTGGGT TATCAACGCG CGGCAGTGGT ACATGGCGGT
GGAATGGATG AAGTGGCTAT TCATGCCCCG ACGCAGGTGG CTGAACTGAA TAACGGCAGT
ATTGAAAGCT ATCAATTGAC GCCAGAAGAT TTTGGTTTGA ATCGCTACCC GCTTGCCGCT
CTACAAGGCG GTATGCCGGA AGAAAACCGT GACATTTTAG CACGGTTGTT ACAAGGTAAA
GGTGAAACAG CACATGCGGC CGCCGTTGCT GCAAACGTCG CCTTGCTGCT GAAGTTATAC
GGCCAAGAAA ACCTGCGCCA TAATGCGCAA CAGGCATTGG AAATGATTCA CAGCGGTCAG
GCTTTTGATC GTGTTACTGC TCTGGCAGCG AGAGGATAA
 
Protein sequence
MQHLFEKLFR AESMSQEESQ QLFAAIVRGE LDPSQLAAVL ISMKVRGETP AEIAGAAQAL 
LADAQHFPRP DYLFADIVGT GGDGTNSINI STASAFVAAS CGVKVAKHGN RSVSSRSGSS
DLLAAFGIRL DMSAEQSRLA LDDLGVCFLF APQYHTGFRH AMPVRQQLKT RTLFNVLGPL
INPARPPLAL IGVYSPELVL PIAQTLKVLG YQRAAVVHGG GMDEVAIHAP TQVAELNNGS
IESYQLTPED FGLNRYPLAA LQGGMPEENR DILARLLQGK GETAHAAAVA ANVALLLKLY
GQENLRHNAQ QALEMIHSGQ AFDRVTALAA RG