Gene YpAngola_A3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3171 
SymbolhisC 
ID5801646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3356505 
End bp3357653 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content52% 
IMG OID641341004 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001607531 
Protein GI162420699 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.777037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAT CTAATAACGT CACGGACCTG GCCCGTGCCA ACATCCGTGC TCTGACTCCC 
TATATGTCCG CACGTCGGTT AGGCGGTAAT GGCGATGTCT GGCTGAATGC CAACGAATAT
CCGCTGGGCA CTGAATATCA GTTGACCACG CAAACCTTCA ATCGCTATCC CGAGTGTCAG
CCTAAGCACG TTATTGAGCG CTATGCCGCT TACGCCGGTT TACCGCCAGA GCAAGTACTG
GTTAGTCGTG GTGCTGATGA AGGGATCGAA CTGCTGATCC GCGCGTTCTG TGAGCCGGGT
CAGGATGCCA TTTTATTCTG CCCACCAACC TACGGCATGT ACGCTGTCAG TGCTGAAACC
TTTGGTGTAG AACGGCGCAC CGTACCCGCT CAGGCTGACT GGCAGTTAGA TTTACCGGCC
ATTGCCAACA ATCTGGAACA GGTAAAAGTG ATCTATGTTT GCAGCCCAAA TAACCCGACG
GGTAATTTAA TCAACCCGGC TGATTTACAG GCGGTGCTGG CACTGGCGCA AGGCCGCGCG
ATTGTCGCCA TCGACGAAGC CTATATTGAG TTTTGTCCAC AAGCATCGGT CAGTAATTGG
CTAAAAGATT ATCCGAATTT AGTGATTTTG CGCACCTTAT CGAAAGCCTT TGCATTAGCG
GGTTTACGTT GTGGCTTTAC GTTAGCCAAC AGCGATATCA TCCAATTGCT GCTTAAAGTG
ATCGCCCCCT ATCCGTTATC TACGCCAGTG GCGGATATTG CCGCGCAAGC ACTCAGCCCA
AAGGGGATTG AGCAAATGCG CCAACGGGTC AGTGAAGTAC GAGCTAACCG CGCATGGCTA
CAATCCGCAC TGCAAGATTG CGCCTGTGTC GAACAGGTGT TCACCAGCGA AAGCAACTAT
TTGCTGGCCC GCTTTACCGC GTCCAGCAGC GTATTCAACG CATTGTGGGA TCAGGGCATT
ATTTTGCGTG ATCAAAATAA ACAACCGGGG TTAGCCAACT GCCTGCGCAT CACCATTGGC
ACCCGTCAGG AGTGTGAGCG AGTGATTGCC GCCCTTGCCC CCCTGCCCGG CATTGATAAC
TCAAATAACA TTGATAACCA GAATAAAACC TATTCTCAGA CCTCCAGCAT CCGTAAGGGA
ACGATATGA
 
Protein sequence
MSQSNNVTDL ARANIRALTP YMSARRLGGN GDVWLNANEY PLGTEYQLTT QTFNRYPECQ 
PKHVIERYAA YAGLPPEQVL VSRGADEGIE LLIRAFCEPG QDAILFCPPT YGMYAVSAET
FGVERRTVPA QADWQLDLPA IANNLEQVKV IYVCSPNNPT GNLINPADLQ AVLALAQGRA
IVAIDEAYIE FCPQASVSNW LKDYPNLVIL RTLSKAFALA GLRCGFTLAN SDIIQLLLKV
IAPYPLSTPV ADIAAQALSP KGIEQMRQRV SEVRANRAWL QSALQDCACV EQVFTSESNY
LLARFTASSS VFNALWDQGI ILRDQNKQPG LANCLRITIG TRQECERVIA ALAPLPGIDN
SNNIDNQNKT YSQTSSIRKG TI