Gene YpAngola_A0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0808 
SymbolthrA 
ID5799270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp826777 
End bp829236 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content50% 
IMG OID641338805 
Productbifunctional aspartokinase I/homeserine dehydrogenase I 
Protein accessionYP_001605383 
Protein GI162418682 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase
[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.693104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTGC TGAAATTTGG CGGGACATCA GTGGCAAATG CCGAGCGCTT TATGCGTGTT 
GCCGATATCA TCGAAAGTAA TGCGCGTCAG GGACAAGTGG CTACGGTGTT ATCCGCTCCC
GCGAAGATAA CGAACCATCT CGTTGCCATG ATCGATAAAA TGGTCGCAGG GCAAGATATC
TCCCCGAACA TCAGCGATGC TGAGCGGATT TTTGCTGAGT TATTGCGTGG ATTAGCTGAT
ACTCAGCCAG GCTTTGATTA TGATCGCCTA AAAGCCTTGG TCGGCCATGA GTTTGCGCAA
CTCAAACACC TGCTACACGG TATTTCGTTG TTGGGGCAGT GCCCAGACAG CATTAATGCC
TCAATTATCT GCCGTGGCGA AAAACTCTCC ATCGCCATTA TGGAGGCGTT ATTTCAGGCT
AAGGGTTATC ACGTTACGGT GATTAATCCA GTCGAAAAAT TGTTGGCGCA GGGCCATTAT
CTCGAATCCA CTGTTGATAT CACGGAGTCT ACCCGCCGTA TCGGTGCCAG CGGTATTCCA
TCTGATCACA TCATCCTGAT GGCTGGTTTT ACTGCGGGTA ATGATAAAGG TGAATTGGTG
GTGTTGGGGC GTAATGGCTC TGATTATTCC GCTGCCGTGC TGGCTGCTTG CTTACGGGCA
GATTGTTGTG AGATTTGGAC CGATGTCGAC GGGGTCTATA CTTGTGACCC GCGTACCGTT
CCAGATGCCA GATTACTGAA ATCGATGTCA TACCAAGAGG CAATGGAGCT TTCCTATTTT
GGTGCTAAAG TTCTTCACCC TCGCACTATC GCTCCTATTG CCCGCTTCCA AATTCCTTGT
CTGATAAAAA ACACCTCTAA CCCGCAAGCC CCCGGTACCT TGATTGGTGG TGAAAGCATT
GATGAGGATT CTCCGGTCAA AGGCATTACC AACCTGAATA ATATGGCGAT GATTAATGTT
TCAGGGCCAG GAATGAAAGG GATGGTCGGT ATGGCTGCCC GCGTCTTTGC GGTGATGTCG
CGCAGCGGTA TTTCAGTGGT GCTAATCACT CAGTCCTCTT CTGAATACAG CATCAGTTTT
TGTGTGCCGC AAGGTGAGTT ATTGCGCGCC CGCCGAGCGC TGGAAGATGA GTTTTATCTG
GAGTTGAAAG ACGGTGTCTT GGATCCTCTG GATGTGATGG AACATCTGGC AATTATCTCT
GTGGTCGGTG ACGGTATGCG CACCCTGCGC GGTATTTCTG CCCGTTTCTT CTCGGCTCTG
GCACGCGCCA ACATCAATAT TATTGCTATC GCTCAAGGCT CGTCTGAGCG CTCTATTTCT
GTGGTTGTCA ATAATGATGC GGTCACTACC GGGGTGCGAG TTTGCCACCA GATGCTGTTT
AACACGGATC AAGTCATTGA GGTGTTTGTC ATTGGCGTGG GTGGTGTGGG GGGCGCATTA
ATCGAACAAA TCTATCGCCA GCAACCGTGG TTGAAACAGC GTCATATTGA TTTACGTGTT
TGTGGTATCG CGAATTCCAA AGCCATGTTG ACCAACGTGC ATGGCATTGC ATTAGATAAC
TGGCGTCAGG AACTGGCTGA AGTTCAGGAG CCGTTTAACC TGAGCCGCTT GATCCGTCTG
GTTAAAGAGT ATCACTTACT GAACCCGGTC ATTGTTGACT GTACCTCAAG CCAGGCGGTT
GCTGATCAAT ATGCTGATTT CCTGACCGAT GGCTTCCATG TGGTGACGCC AAACAAGAAA
GCCAACACTT CATCAATGAA TTATTACCGC CAAATGCGTG CCGCTGCGAC GAAATCGTGC
CGTAAATTCT TGTATGACAC AAACGTTGGG GCGGGGCTAC CGGTGATCGA GAACTTACAG
AACCTGCTCA ATGCGGGTGA TGAACTGATG CGCTTCACCG GTATTCTGTC CGGTTCACTT
TCCTTCATCT TCGGTAAGCT AGATGAGGGG ATGTCATTGT CAGAGGCAAC ACGGCAAGCT
AAGGCGTTAG GGTATACCGA ACCCGATCCA CGTGATGATC TCTCTGGGAT GGATGTTGCC
CGTAAATTGT TGATTTTGGC CCGTGAAGCA GGGTACAAAC TTGAGTTGGC TGATATCGAG
GTTGAGTCCG TCTTACCAGC AAGCTTTGAT GCTTCAGGTG ATGTGGATAC TTTCCTGGCG
CGCCTGCCAT CATTGGATGC TGAATTTACC CGCTTGGTAG CGAATGCTGC GGAGCAGGGC
AAAGTGCTAC GTTATGTCGG AGTGATTGAA GACGGGCGTT GTAAAGTGCG GATGGAGGCA
GTTGACGGTA ACGACCCGTT GTATAAAGTT AAGAATGGCG AGAATGCGTT GGCTTTCTAT
ACCCGCTATT ATCAGCCAAT TCCGTTGGTG CTACGCGGTT ATGGCGCAGG CAATGATGTG
ACTGCCGCTG GGGTGTTTGC CGATCTTTTA CGCACATTAT CATGGAAGTT GGGAGTTTAA
 
Protein sequence
MRVLKFGGTS VANAERFMRV ADIIESNARQ GQVATVLSAP AKITNHLVAM IDKMVAGQDI 
SPNISDAERI FAELLRGLAD TQPGFDYDRL KALVGHEFAQ LKHLLHGISL LGQCPDSINA
SIICRGEKLS IAIMEALFQA KGYHVTVINP VEKLLAQGHY LESTVDITES TRRIGASGIP
SDHIILMAGF TAGNDKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRTV
PDARLLKSMS YQEAMELSYF GAKVLHPRTI APIARFQIPC LIKNTSNPQA PGTLIGGESI
DEDSPVKGIT NLNNMAMINV SGPGMKGMVG MAARVFAVMS RSGISVVLIT QSSSEYSISF
CVPQGELLRA RRALEDEFYL ELKDGVLDPL DVMEHLAIIS VVGDGMRTLR GISARFFSAL
ARANINIIAI AQGSSERSIS VVVNNDAVTT GVRVCHQMLF NTDQVIEVFV IGVGGVGGAL
IEQIYRQQPW LKQRHIDLRV CGIANSKAML TNVHGIALDN WRQELAEVQE PFNLSRLIRL
VKEYHLLNPV IVDCTSSQAV ADQYADFLTD GFHVVTPNKK ANTSSMNYYR QMRAAATKSC
RKFLYDTNVG AGLPVIENLQ NLLNAGDELM RFTGILSGSL SFIFGKLDEG MSLSEATRQA
KALGYTEPDP RDDLSGMDVA RKLLILAREA GYKLELADIE VESVLPASFD ASGDVDTFLA
RLPSLDAEFT RLVANAAEQG KVLRYVGVIE DGRCKVRMEA VDGNDPLYKV KNGENALAFY
TRYYQPIPLV LRGYGAGNDV TAAGVFADLL RTLSWKLGV