Gene YpAngola_A2510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2510 
SymbolhutF 
ID5800980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2626795 
End bp2628165 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content50% 
IMG OID641340380 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_001606923 
Protein GI162420132 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00394425 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAGTTT ATTTTACCAA GCGTGCTTTT TTACCTGATG GATGGGCTGC AGACGTGCAA 
ATTGCGGTAG ATGAGCTGGG GGATATACAG CGTATCAGCA CTGGCAGTAG CAGCAGTGGT
TGTCAGGTTT TATCCGGGCC AGTATTACCT GGTATGCCTA ATCTGCACTC TCATGCTTTT
CAGCGCATGA TGTCGGGGCT GGCAGAAGTT GCCGGTGATC CACAAGATAG CTTTTGGACT
TGGCGGGATC TCATGTATCG GTTGGTGCAA CAACTGACGC CGGAACAGGT TGGTGTTATT
GCCCGGCAAC TTTATATCGA AATGCTGAAA GGCGGATATA CCCAAGTTGC TGAATTTCAC
TATTTACATC ATAGCCCTGA TGGTTCCCCT TACAATGACA TAGGGGAAAT GACGGCCCAA
TTGAGTCAGG CGGCACAAGA CGCTGGGATC GGAATGACTT TATTGCCAGT GCTGTATAGC
TACGCAGGAT TTGGTGCTCA ACCCGCACAG CAAGGCCAGA GCCGTTTTAT TCAGGATACC
GAGAGTTATC TCAGACAACA GCAGGTTATC CGTCGGCAGT TGGCTAATCA GCCTTTACAA
AATCAGGGGC TATGTTTTCA CTCATTACGT GCTGTTGAGT TAAGTCAAAT GCAGCAGATT
TTACACGTAT CGGATAAACA GTTACCGGTA CATATTCATA TTGCTGAACA GCAGAAAGAA
GTTAATGACT GCCTGGCATG GTGTGGCCAG CGCCCAGTTG CCTGGCTATA TGGTCACTTA
CCGGTCGATA GCCGCTGGTG CCTTGTCCAC GCGACACATT TGGATACGTC AGAATTGGTT
ATGTTAGCCA ATAGTCAGGC TGTTGCCGGG TTGTGTCCAA CGACAGAGGC CAATTTGGGA
GATGGTATTT TCCCCGGTGT TGACTATATA CATCATCAGG GCCGTTGGGG GATAGGTTCT
GATAGCCATG TTTCGCTGGA TGTTGTGGAG GAGCTACGTT GGTTGGAATA TGGGCAACGC
TTGCGTGATC AACGGCGTAA CCGTTTGACC TGTGAGCGGC AACCCGCCGT GGCGGATCTA
TTGTACAGCC AAGCATTAGC CGGAGGGCGT CAGGCTTGCG GGCGTCAAAT TAGCCAGCTA
GCCGTAGGCT ATCGTGCCGA TTGGTTAGTA CTCGATGGCG ATGATCCTTA TATTGCCGGG
ACAAGATCGG CATCTTTGTT GAATAGATGG TTATTTGCGG GGGGTAAATC GCAAATTCGA
GATGTTTATG TGGCAGGCAA GGCGGTAATC GTGGATAGAT ATCATCCATT GCAACAGCAA
ACTGCGCAAG CTTTTCTGGC TGTACTGAAA GCCTGCCAAC AGGAGGTCTG A
 
Protein sequence
MPVYFTKRAF LPDGWAADVQ IAVDELGDIQ RISTGSSSSG CQVLSGPVLP GMPNLHSHAF 
QRMMSGLAEV AGDPQDSFWT WRDLMYRLVQ QLTPEQVGVI ARQLYIEMLK GGYTQVAEFH
YLHHSPDGSP YNDIGEMTAQ LSQAAQDAGI GMTLLPVLYS YAGFGAQPAQ QGQSRFIQDT
ESYLRQQQVI RRQLANQPLQ NQGLCFHSLR AVELSQMQQI LHVSDKQLPV HIHIAEQQKE
VNDCLAWCGQ RPVAWLYGHL PVDSRWCLVH ATHLDTSELV MLANSQAVAG LCPTTEANLG
DGIFPGVDYI HHQGRWGIGS DSHVSLDVVE ELRWLEYGQR LRDQRRNRLT CERQPAVADL
LYSQALAGGR QACGRQISQL AVGYRADWLV LDGDDPYIAG TRSASLLNRW LFAGGKSQIR
DVYVAGKAVI VDRYHPLQQQ TAQAFLAVLK ACQQEV