Gene YpAngola_A2512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2512 
SymbolhutI 
ID5800982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2629278 
End bp2630498 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content50% 
IMG OID641340382 
Productimidazolonepropionase 
Protein accessionYP_001606925 
Protein GI162419282 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0955381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCAG TAACTCACTG TGACAGCTTA TGGTTCGGGG CCGATATCAT TACGATGCGC 
GGGGGAAATT ATCAGTTGAT TCCGCAAGGG GCAATCGCTG TCACTGGCGA TAAGATAGTC
TGGATTGGGC CACATGCCGA ATTACCGCCT ATTCATGCCG CACGTCAGGT CGTATATGAA
GGTGGTCTTA TCACCCCTGG ATTGATTGAC TGTCACACCC ATCTCGTGTT TGGCGATGAT
CGTAGCAATG AATTTGAGCA ACGCCTTAAC GGGGTCAGCT ATGCCGAAAT TGCTGCTAAT
GGCGGTGGTA TTATTTCAAC CGTCAGAGCC ACACGCCAAG CTAGCGAACA GCAACTACTG
GAACAAGCCC TATTTCGTCT GAAGCCCTTA CTTGCTGAAG GGGTGACTAC GATTGAGATT
AAGTCTGGCT ATGGCCTTAA TCTTGAAAGT GAAATAAAAA TGTTGCGAGT GGCCCGCCGA
TTGGGGGAGT TACTGCCTAT TGACGTCAAA ACGACTTGTT TGGCCGCCCA TGCGCTACCG
CCCGAGTTTA TCGGGCAGCC TGATGATTAT ATTGATGTCG TATGTAATAG CATTATTCCT
CAGGTGGCAG TTGAAAACTT AGCCGATGCC GTGGACGCAT TTTGCGAACA TTTAGCTTTT
TCACCGGCTC AAGTTGAGCG AGTATTTTTA GCCGCACAAA AAGCCGGGCT ACCTGTAAAA
CTGCACGCAG AGCAACTTTC TGCTCTCCGT GGCGCGACTC TGGCCGCTAA ATTCCATGCG
ATATCGGCAG ACCATCTGGA GTACGCGACT GAATCTGATG TCCAGGCTAT GGCAAATGCG
GGTACTGTCG CAGTCTTACT ACCAGGTGCC TACTACTTAT TGCGGGAAAC ACAATGCCCC
CCAATTGATC TGTTCCGCCA GTATAAGGTC CCCATGGCAC TGGCCAGTGA TGCCAACCCA
GGGACATCTC CGGTACTTTC ACTACGCTTG ATGCTCAATA TGGCTTGCAC GTTATTCCGC
ATGACACCAG AAGAAGCACT GGCTGGTGTC ACGTGCCACG CAGCTCAAGC TCTTGGTGTA
CAACAGACTC AAGGTACGTT GGAGACAGGG AAATTAGCTA ACTGGGTGCA TTGGCCCTTA
TCACACCCAG CCGAGTTAGC TTATTGGTTA GGAGGGCAAT TACCTGCCAC TGTCGTATTC
CGAGGAGAAG TACGCCCATG A
 
Protein sequence
MVSVTHCDSL WFGADIITMR GGNYQLIPQG AIAVTGDKIV WIGPHAELPP IHAARQVVYE 
GGLITPGLID CHTHLVFGDD RSNEFEQRLN GVSYAEIAAN GGGIISTVRA TRQASEQQLL
EQALFRLKPL LAEGVTTIEI KSGYGLNLES EIKMLRVARR LGELLPIDVK TTCLAAHALP
PEFIGQPDDY IDVVCNSIIP QVAVENLADA VDAFCEHLAF SPAQVERVFL AAQKAGLPVK
LHAEQLSALR GATLAAKFHA ISADHLEYAT ESDVQAMANA GTVAVLLPGA YYLLRETQCP
PIDLFRQYKV PMALASDANP GTSPVLSLRL MLNMACTLFR MTPEEALAGV TCHAAQALGV
QQTQGTLETG KLANWVHWPL SHPAELAYWL GGQLPATVVF RGEVRP