Gene YpAngola_A2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2521 
Symbol 
ID5800991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2640251 
End bp2641552 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content52% 
IMG OID641340391 
Productaminotransferase 
Protein accessionYP_001606934 
Protein GI162421035 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGG TGCAATCGTA TCAGTTGTTG GAAAGCCAGG GGTGGATCGT TGCCCGCCCA 
CAATCGGGCT ACTATGTGGC CTCACGGCCC ACGCAGTTAC CGCAACCACA GCGGGGTAAT
AAGCTCCATC TGGATGAACA GGTTGATATC AATACCTTTA TTTTTAATGT GTTACAGGCA
GGGAAAGATC CGGCAATCAT GCCGTTCGGT TCAGCATTTC CAGATCCGAG TTTATTGTTA
CAACCGCGTT TGTCGCGGGC GTTGGCCAGC GTCGCTCGCA AGATCTCGCC ACAAAGTTCT
GTAACGAATT TGCCACCGGG TAATGAAAGT TTACGGCGCA ATATTGCGCA ACGTTATGCT
GCCTGCGGGA TGAATGTATC ACCCGAGGAA ATTGTGATTA CGGCGGGGGC GATGGAGTCA
CTGAGCCTTA GCTTACAAGC CGTGACACAG CCCGGTGATT GGGTCGTTAT TGAATCGCCC
GCGTTTTATG GTGCATTACA GGCGATTGAA CGCCTACGAC TTAAGGCTAT TGCCATTACG
ACGGATCCCC AGCAAGGTGT GGATTTGGAG GCCCTTGAGC AGGTTATCAG CCAGTATCCG
ATAAAAGCGT GCTGGTTAAT GCCCCATTTT CAAAACCCAA TGGGGGCCAC CATGCCGTGG
TCACAGAAAC AGCGACTCGT CACCCTGCTA CAGCAGAAGG CTATTTCACT GATTGAAGAT
GATGTATATG GCGAACTGTA TTTTGGTGCT GAACGCTTGC TACCGGCAAA AGCGCTTGAC
CAGCATCGGC AGATTTTACA TTGCTCCTCT TTTTCTAAAT GCCTGGCGCC GGGGTTCCGT
GTGGGGTGGG TGGCAGCAGG TGAACATGCT CAGCGTATTC AACATTTACA GTTGATGAGC
ACCGTTTCGG CCAGCGTCCC GACCCAACTG GCGATTGCGG ATTATCTGAG CCAAGGGGGA
TACGATACTC ACCTGCGCCG CTTGCGTCGG GTAATGGAAC AGCGGATGAG CACGTTGCAT
CAGGCCGTTG TTGAACACTT CCCCAAAAAC ATCAAAATCA GCCACCCCGC TGGCGGCTAT
TTCTTATGGT TAGAATTGGA GCCACCTTTT AATGCCAGTG AACTGTACCG ACGGGCACTG
GAGCAGGGGG TTAGCATTGC TCCTGGGAGA ATGTTCACCA CAGGGAGTCA GTTTGATCAT
TGCTTTCGCC TTAATGCCTC CTTTGCCTGG TCAGAACAAA GCGCAAAAGC GATTCGTATT
TTGGCGAAGT TAATCGGCCA ATTGACGAAC GAACGTCGAT AG
 
Protein sequence
MTVVQSYQLL ESQGWIVARP QSGYYVASRP TQLPQPQRGN KLHLDEQVDI NTFIFNVLQA 
GKDPAIMPFG SAFPDPSLLL QPRLSRALAS VARKISPQSS VTNLPPGNES LRRNIAQRYA
ACGMNVSPEE IVITAGAMES LSLSLQAVTQ PGDWVVIESP AFYGALQAIE RLRLKAIAIT
TDPQQGVDLE ALEQVISQYP IKACWLMPHF QNPMGATMPW SQKQRLVTLL QQKAISLIED
DVYGELYFGA ERLLPAKALD QHRQILHCSS FSKCLAPGFR VGWVAAGEHA QRIQHLQLMS
TVSASVPTQL AIADYLSQGG YDTHLRRLRR VMEQRMSTLH QAVVEHFPKN IKISHPAGGY
FLWLELEPPF NASELYRRAL EQGVSIAPGR MFTTGSQFDH CFRLNASFAW SEQSAKAIRI
LAKLIGQLTN ERR