Gene YpAngola_A1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1148 
SymbolkdsD 
ID5799613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1187755 
End bp1188783 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content51% 
IMG OID641339124 
ProductD-arabinose 5-phosphate isomerase 
Protein accessionYP_001605694 
Protein GI162418897 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGATG ACGGTAATAC CAACATTGGG CGGATATTAA GTATGTCTAC TTTTGATTTA 
CAGCCGGGCG TGGATTTCCA GCAGGCAGGT AAGCAGGTAC TTCAGATTGA ACGCGAAGGG
CTGGCTCAAC TCGATCAATA CATAAACGAG GATTTCTCCA GAGCCTGTGA GGCCATTTTT
CGTTGCCACG GCAAAGTCGT CGTGATGGGA ATGGGCAAAT CTGGTCATAT TGGCTGCAAA
ATTGCAGCCA CATTCGCCAG CACGGGCACA CCTGCATTTT TTGTCCATCC TGGTGAAGCC
AGCCATGGCG ATTTGGGCAT GATCACGCCA CAAGATATTG TATTAGCTAT CTCTAACTCT
GGGGAATCCA ACGAAATCCT CACGTTGATC CCCGTGCTTA AGCGCCAGAA AATTCTGTTG
ATTTGCATGA GCAGTAACCC TGAGAGCACC ATGGGTAAAG CGGCCGATAT TCACTTGTGC
ATTAATGTGC CACAAGAGGC CTGTCCGCTG GGGCTAGCGC CAACCACCAG TACGACTGCA
ACACTGGTTA TGGGGGATGC GTTGGCGGTA GCCTTGCTCA AAGCACGGGG TTTCACGCAG
GAAGATTTCG CACTCTCTCA CCCAGGTGGC GCGCTGGGGC GTAAGTTGCT GCTGCGGATC
AGCGATATTA TGCATACGGG TACGGAGATC CCCACCGTCA GCCCTGATGC ATCATTACGT
GATGCCTTGC TAGAAATTAC TCGGAAAAGT CTGGGTTTGA CCGTTATTTG TGACGATTCA
ATGAGGATTA AAGGTATCTT TACCGACGGT GACTTGCGCC GGGTATTTGA TATGGGCATT
GATCTGAATA ATGCAAAAAT TGCTGACGTC ATGACTCGCG GGGGTATTCG AGTCCCTCCG
AACATATTGG CGGTGGACGC GCTCAATCTG ATGGAGTCAC GCCATATCAC TGCGTTGCTC
GTCGCTGATG GTGACCAATT ACTGGGTGTC GTACATATGC ACGATATGCT GAGAGCCGGT
GTTGTCTGA
 
Protein sequence
MKDDGNTNIG RILSMSTFDL QPGVDFQQAG KQVLQIEREG LAQLDQYINE DFSRACEAIF 
RCHGKVVVMG MGKSGHIGCK IAATFASTGT PAFFVHPGEA SHGDLGMITP QDIVLAISNS
GESNEILTLI PVLKRQKILL ICMSSNPEST MGKAADIHLC INVPQEACPL GLAPTTSTTA
TLVMGDALAV ALLKARGFTQ EDFALSHPGG ALGRKLLLRI SDIMHTGTEI PTVSPDASLR
DALLEITRKS LGLTVICDDS MRIKGIFTDG DLRRVFDMGI DLNNAKIADV MTRGGIRVPP
NILAVDALNL MESRHITALL VADGDQLLGV VHMHDMLRAG VV