Gene YpAngola_A1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1976 
Symbol 
ID5800447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2058770 
End bp2059960 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content49% 
IMG OID641339900 
Productaromatic amino acid aminotransferase 
Protein accessionYP_001606450 
Protein GI162421411 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0279387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.804208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAAA AAATTACTGC TGCACCTGCG GATCCTATTT TAGGTTTGAC CGATATCTTC 
CGCGCGGATG ACCGCGCTCA TAAAATCAAT CTGGGGATCG GTGTCTACAA AGACGAAACC
GGTAAAACCC CAGTGCTGAC CAGCGTGAAG AAAGCTGAGC AGTATTTACT GGAAAATGAA
GCTACCAAAA ATTATCTGGG TATTGACGGC CTGCCTGTTT TTGCCAGTTG CACCCAAGAA
CTGCTGTTCG GTGCTAATAG TGCAATTATT GCGGATAAAC GTGCTCGTAC CGCTCAAACG
CCAGGTGGTA CCGGTGGTTT GCGCATTGCC GCTGATTTTA TCGCCCATCA GACCAGCGCT
AAACGTGTCT GGGTCAGTAA CCCAAGCTGG CCAAACCATA AAAACGTCTT CGAAGCCGCA
GGGCTGGAAG TGGTGGAGTA CGCTTATTAT GATGCTGCTA ACCATGCATT GGACTTCGAT
GGCTTGTTAA ATAGCCTGTC AGAAGCTCAG GCGGGTGATG TGGTGTTGTT CCACGGCTGC
TGCCATAACC CAACGGGTAT CGACCCAACA GAAACTCAGT GGAGCCAACT GGCGGAGTTA
TCGGTTGCTA AAGGCTGGTT GCCACTGTTT GATTTCGCTT ATCAAGGCTT TGCCAACGGT
TTGGAAGAAG ATGCTCAGGG CCTGCGTATT TTCGCGGCAA CACATCAAGA GTTGATCGTT
TGCAGCTCTT ATTCGAAAAA CTTTGGTTTG TACAATGAAC GTGTTGGGGC TTGTACCCTT
GTCGCCGCCG ACAGTAACGT TGCCGATACC GCATTCAGCC AAGTTAAAGC GGTTATCCGA
GCCAACTACT CTAACCCACC GGCACATGGT GCATCAGTCG TTGCCACTAT TCTGAGTAAT
GCCGCACTAC GGGCGATTTG GGAACAAGAA CTGACCGATA TGCGTCAGCG CATCCAACGT
ATGCGTCAGT TGTTTGTTAA TACCTTGCAG GAAAAAGGCG CTCAACAAGA TTTTAGCTTT
ATCATTAACC AGAATGGGAT GTTCTCGTTC AGTGGTCTGA CCAAAGAACA AGTGCTGCGC
CTGCGTGATG AGTTCGCCGT CTATGCGGTA AATTCTGGCC GGGTTAACGT GGCGGGGATG
ACACCAGACA ATATGGCGCC GTTATGTGAA GCTATCGTTG CGGTGCTCTA A
 
Protein sequence
MFEKITAAPA DPILGLTDIF RADDRAHKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE 
ATKNYLGIDG LPVFASCTQE LLFGANSAII ADKRARTAQT PGGTGGLRIA ADFIAHQTSA
KRVWVSNPSW PNHKNVFEAA GLEVVEYAYY DAANHALDFD GLLNSLSEAQ AGDVVLFHGC
CHNPTGIDPT ETQWSQLAEL SVAKGWLPLF DFAYQGFANG LEEDAQGLRI FAATHQELIV
CSSYSKNFGL YNERVGACTL VAADSNVADT AFSQVKAVIR ANYSNPPAHG ASVVATILSN
AALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGAQQDFSF IINQNGMFSF SGLTKEQVLR
LRDEFAVYAV NSGRVNVAGM TPDNMAPLCE AIVAVL