Gene YpAngola_A2505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2505 
Symbol 
ID5800975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2621394 
End bp2622410 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID641340376 
ProductNAD-dependent epimerase/dehydratase family protein 
Protein accessionYP_001606919 
Protein GI162421220 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00000697027 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGTCT TAGTGACAGG GGCAACCAGT GGATTGGGGC GCAACGCAGC ACAATGGCTG 
TTGGAGGCAG GGCACGAGGT CTATGCTATC GGGCGGGATC AACTGGCGGG GGAAGAACTG
CGCAAGTTGG GCGCCACCTT TATCCCGCTG GATCTGACCA TGACCACGAT GGAGGTCTGC
CAGCAATGGC TCAAAACGTG CGATGTGGTT TGGCACTGTG CCGCCAAATC AGCGCCGTGG
GGTAACCCAC AGGATTTTCA CCAGACCAAT GTGGTGGTGA CGCATAAACT TGCTCAGGCC
GCCGGTCGCG AGGGGGTTAA ACGGTTCATT CATATCTCAT CACCTGCCGT GTATTTCGAT
TTTCGCCATC ATCATGATCT GCCTGAAACA TACCGGGCCA GCCGCTTTTC CAGCCACTAT
GCCAGCAGTA AATATGCGGC GGAGCAGGTG TTGCATGAAT GTATCGCTCA TTATCCGGAC
ACCACGTATG TGATCCTGCG CCCACGTGGG TTATTTGGTC CCCACGATAG GGTGATTGTG
CCGCGTTTGC TGCAACAACT GAGCAGGGAT CGCAATGTGC TGCGTTTACC GGGAGGAGGG
CAAGCGCAGC TTGATCTCAC GTTTGTGTTG AATGTGGTAC ATGCCATGAT GTTAGCCACT
GATAACGACG GGCTACGCTC CGGTGCGATT TATAATATTA CCAATCAGGA GCCACAACGG
CTGGTGACGA TGTTGGATTC TCTGTTGAAT CAGCAGCTTC ACATTAACTA TACCTTGCAG
CCGGTGCCCT ACTCGTTACT TTCTGTCGTG GCGGCAGGGA TGGAACTGGT CGCCAGTATG
ACCCAGAAAG AACCACTGTT AACCCGGTAC AGTGTTGGCG CAGTCTATTT TGATATGACC
CTTAATTCAG AACGTGCCAT TAATGAACTG GGTTACCGGC CTCGTTACTC GATGGCGGAG
GGGATTGTGC TGGCTGGCGA GTGGCTTAGC GCGCAGAGGA GTGGCCAGCA TGGCTAA
 
Protein sequence
MKVLVTGATS GLGRNAAQWL LEAGHEVYAI GRDQLAGEEL RKLGATFIPL DLTMTTMEVC 
QQWLKTCDVV WHCAAKSAPW GNPQDFHQTN VVVTHKLAQA AGREGVKRFI HISSPAVYFD
FRHHHDLPET YRASRFSSHY ASSKYAAEQV LHECIAHYPD TTYVILRPRG LFGPHDRVIV
PRLLQQLSRD RNVLRLPGGG QAQLDLTFVL NVVHAMMLAT DNDGLRSGAI YNITNQEPQR
LVTMLDSLLN QQLHINYTLQ PVPYSLLSVV AAGMELVASM TQKEPLLTRY SVGAVYFDMT
LNSERAINEL GYRPRYSMAE GIVLAGEWLS AQRSGQHG