Gene YpAngola_A1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1871 
Symbol 
ID5800342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1936499 
End bp1937632 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content56% 
IMG OID641339803 
Producthypothetical protein 
Protein accessionYP_001606358 
Protein GI162419831 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00529235 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGAAG TGCGTATTGG ATTGATTGGT ACCGGGTATA TCGGTAAGGC GCACGCCATT 
GCCTACGCAC AGGCACCGAC GGTGTTTGAA TTGCGCGGCA AACTGGTGCG CGAAATGGTG
GCCGAAGTCT CACCAGCGCT GGCGGCACAG CGTGCGCAGG CTTTCGGTTT CAACCGGTTT
ACCTGCGACT GGCGGGAGCT GGTCGCGGAT CCGGCTATTG ATGTGGTTGA TATTTGCTCA
CCTAATTATC TACATAAAGA GATGGCGCTG GCGGCCATCC ACCACGGCAA ACATGTCTAT
GCGGAGAAAC CGCTGGCGCT GAATGCCCGT GACGCCAGCG AGATGGCGGC GGCCGCAACG
CGCGCTGGGG TGAAAACGTT GGTAGGGTTC AATTACATCA AAAACCCCAG TGCGAAGCTG
GCTAAAGAGA TCATTGAACG TGGTGAAATC GGTGAGGTGA TCCACTTTTA TGGCACCCAT
AACGAAGACT ATATGGCCGA TCCCAATACC CCTATTCACT GGCACTGTTT ACACGCGACG
GCAGGGCTGG GAGCACTTGG CGATCTGGCG GCCCACATCG TCAGCATGGC GCAATATCTG
GTGGGGGAAA TAACGCAGGT ATGCGGTGAT CTGAAAACCG TCGTGGTGAC ACGCCCGGCG
AGCGTTGGCT CCAGCGCCAG AGTGGCGGTT GAAAACGAAG ATCAGGCCCA TGCCATGGTG
CGTTTTGTGA ATGGCGCTCA GGGAGTGATT GAAGCCTCGC GGGTGGCTTG CGGGCGCAAA
ATGGGCCTCT CTTACATGAT TACCGGTACT CAAGGGGCGA TCAGTTTTAC CCAAGAACGT
ATGGCGGAAC TCAAACTCTA CCTGCACAAC GACCCGGTCA ACCGACAAGG CTTCCGTACC
CTGCTCGTCG GCCCGGCGCA CCCAGAGTAT GCCGCGTTCT GTATGGCTGC GGGCCACGGT
ATTGGTTTTA ACGATCAAAA AACCGTGGAA GTGCGTGACT TGATCGACGG CATCGCGATG
GACACGCCGC TGTGGCCCGA TTTCGCCGAG GGCTGGAAAG TCTCACGCAT TCTCGATGCG
ATTGCTCTGT CTCATCAGGA TAGCCGCTGG GTGAATGTGA CCGACATTGT CTGA
 
Protein sequence
MKEVRIGLIG TGYIGKAHAI AYAQAPTVFE LRGKLVREMV AEVSPALAAQ RAQAFGFNRF 
TCDWRELVAD PAIDVVDICS PNYLHKEMAL AAIHHGKHVY AEKPLALNAR DASEMAAAAT
RAGVKTLVGF NYIKNPSAKL AKEIIERGEI GEVIHFYGTH NEDYMADPNT PIHWHCLHAT
AGLGALGDLA AHIVSMAQYL VGEITQVCGD LKTVVVTRPA SVGSSARVAV ENEDQAHAMV
RFVNGAQGVI EASRVACGRK MGLSYMITGT QGAISFTQER MAELKLYLHN DPVNRQGFRT
LLVGPAHPEY AAFCMAAGHG IGFNDQKTVE VRDLIDGIAM DTPLWPDFAE GWKVSRILDA
IALSHQDSRW VNVTDIV