Gene YpAngola_A1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1907 
SymbolhemB 
ID5800378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1979631 
End bp1980653 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content48% 
IMG OID641339833 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001606388 
Protein GI162420498 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00228937 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTATG CATTTCCGGG CACCTTTCCT GGTCGCCGTA TGCGCCGTGT ACGCTGTCAT 
GATTTCAGCC GCCGTTTGGT TGCCGAGAAT CACTTGACGG TCAATGACTT GATTTACCCG
GTGTTTGTCA TGGAAGGGAC TCACCAGCAA CAAGCCGTTT CTTCAATGCC GGGGGTCTCT
CGTATGACAA TTGATCTGCT GCTGAAAGAA GCAGAAGCAA TTGCTAAGCT CGGTGTGCCG
GTTATATCGC TATTCCCGGT CATTGAAGCA GGGAAAAAAT CATTATATGC AGAAGAGGCA
TATAACCCGG ATGGTTTAGT TCAGCGTACT GTGCGTGCGT TGAAAGATGC AGTTCCTGAG
CTAGGTATTC TGACTGACGT GGCACTCGAT CCCTATACGA CCCATGGTCA AGATGGTGTG
ATTGATTCTG ATGGTTACGT GATCAATGAT GTCACCAAAG AGATTCTAGT GCGTCAGGCA
CTATCGCATG CTGAAGCTGG CGCTGAGATT ATTGCACCAA GCGATATGAT GGATGGCCGT
ATAGGTGCAA TTCGTGATCA ATTAGAGCGC CAAGGGTTAG TGAACACCCA GATAATGGCT
TATTCGGCTA AGTATGCGTC TTGCTACTAT GGCCCATTCC GCGATGCAAT TGGCTCGAGC
AGTAATCTAA AAGGGGGGGA TAAAAAAACT TATCAGATGG ATCCCGCCAA CAGTGATGAA
GCACTGCAAG AAATAGCTCA AGATTTACAA GAAGGCGCTG ATATGGTGAT GGTAAAACCA
GGAATGCCAT ATCTGGATGT TGTACGCAGG GTGAAAGATA CTTTTGGTGT TCCAACCTTT
GCCTATCAGG TGTCTGGTGA ATATGCCATG CATATGGCAG CGATTCAGAA TGGCTGGCTC
CAAGAGAAAC CGACGGTGAT GGAGTCGTTG TTATGCTTTA AGCGTGCAGG TGCGGATGGC
GTATTAACTT ATTTCGCAAA GCAAGTTGCG CAATGGCTAC ATGATGACCA GATGCAGCGC
TAA
 
Protein sequence
MSYAFPGTFP GRRMRRVRCH DFSRRLVAEN HLTVNDLIYP VFVMEGTHQQ QAVSSMPGVS 
RMTIDLLLKE AEAIAKLGVP VISLFPVIEA GKKSLYAEEA YNPDGLVQRT VRALKDAVPE
LGILTDVALD PYTTHGQDGV IDSDGYVIND VTKEILVRQA LSHAEAGAEI IAPSDMMDGR
IGAIRDQLER QGLVNTQIMA YSAKYASCYY GPFRDAIGSS SNLKGGDKKT YQMDPANSDE
ALQEIAQDLQ EGADMVMVKP GMPYLDVVRR VKDTFGVPTF AYQVSGEYAM HMAAIQNGWL
QEKPTVMESL LCFKRAGADG VLTYFAKQVA QWLHDDQMQR