Gene YpAngola_A3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3332 
Symbol 
ID5801809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3546863 
End bp3548023 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID641341153 
Productputative aminotransferase 
Protein accessionYP_001607675 
Protein GI162421729 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.816568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTT TATCTTTTAT TCCAGACAGT AAATTGCCTG CGCAAGGCAC CACTATCTTC 
ACGCAAATGA GTGCATTGGC ACAAAAGCAC CAAGCGATCA ATTTGTCACA GGGCTTCCCT
GATTTTGATG GGCCGGATTA CCTGAAGCAG CGGCTAGCCT ATCATGTCGA CCAAGGCGCA
AACCAATATG CGCCGATGAT CGGCGTCGCA CCACTGCGTC ATGCTATCGC GGAGAAAACG
GCTAATCTCT ACGGGTGGCG GCCCGATGCC GAGCACGAAG TGACCGTGAC CACCGGGGCC
AGCGAAGCAC TATTTGCTGC CATCACCGCC CTCGTTCGTC CTGGTGATGA AGTGATCTGC
CTTGACCCCA GCTACGACAG CTATGCACCG GCAGTCAAAC TGGCGGGTGG CGTCCTCAAG
CGGATCACAC TAAAACCCCC TGCTTTTACC ACTGATTGGG CTGAATTTAC CCGTTTGGTC
TCTGAACGCA CCCGTCTCGT TATCGTTTAT ACCCCCCATA ACCCGTCGGC TACCGTTTGG
TGTGCAGAAG ATTTTGAACA GCTTTGGCAG GTCATTGCAG AACGCAATAT TTATGTTTTG
AGTGATGAAG TTTACGAGCA CATCTGCTTT AGCCGTTCAG GTCATGCCAG TGTGTTGGCC
CATCCGCAAC TGCGTCAGCG AGCGATTGCC GTTTCTTCGT TCGGCAAAAC CTTTCATATG
ACGGGCTGGA AAGTGGGTTA TTGCATCGCA CCCGCCGCCA TCAGCGCCGA AGTGCGCAAA
ATTCACCAAT ACCTGACCTT CTCCGTCTGC ACACCGGTCC AACTGGCATT GGCAGATATG
CTTAATGCCG AGCCAGAACA CTGGCAGCAG TTGCCTGAAT TTTACCGTGC CCGCCGCGAT
CGTTTCGTCA AGGCACTGGC AGCCAGTCGC CTGAAAATTC TGCCAAGCGA GGGGACCTAT
TTCCTGTTGG CGGATTACAG CGGCATTTCA GATCTTGATG ATGTTGAGTT CTGTCAATGG
CTCACCGAGC ACGTGGGCGT TGCTGCGATA CCGTTATCGG TCTTTTGTGA AGCTCCGTTC
CCCCATAAAT TGATCCGGCT GTGCTTCGCC AAACAAGATG CCACGCTGGA CGCCGCCGCA
GAGAGATTAT GTCAACTTTA A
 
Protein sequence
MSTLSFIPDS KLPAQGTTIF TQMSALAQKH QAINLSQGFP DFDGPDYLKQ RLAYHVDQGA 
NQYAPMIGVA PLRHAIAEKT ANLYGWRPDA EHEVTVTTGA SEALFAAITA LVRPGDEVIC
LDPSYDSYAP AVKLAGGVLK RITLKPPAFT TDWAEFTRLV SERTRLVIVY TPHNPSATVW
CAEDFEQLWQ VIAERNIYVL SDEVYEHICF SRSGHASVLA HPQLRQRAIA VSSFGKTFHM
TGWKVGYCIA PAAISAEVRK IHQYLTFSVC TPVQLALADM LNAEPEHWQQ LPEFYRARRD
RFVKALAASR LKILPSEGTY FLLADYSGIS DLDDVEFCQW LTEHVGVAAI PLSVFCEAPF
PHKLIRLCFA KQDATLDAAA ERLCQL