Gene YpAngola_A2246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2246 
Symbol 
ID5800716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2350245 
End bp2351243 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content49% 
IMG OID641340146 
Producthypothetical protein 
Protein accessionYP_001606691 
Protein GI162420730 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000358577 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.308003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTA TGATAACGGG GGCTGCCGGT TTTCTGGGGC GTCGATTGAT TGACGGCTTA 
CTGCAACAGG ATTATCTGAC TGATCAGCAG GGGGAGCCGC GCAATATCAA TAAAATCATT
GCCTATGATG TGCTTCCCTT GATGGAAGTA GAGGATCCAC GGGTTCAGGT TGTTTGTGGT
GACATTGCCG ATCAGAACGG ACTATCGGCG GTCTGGGATC GCCAAATTGA CACTATTTTC
CATTTGGCAG CGGTTGTTTC CAGCCAGGCC GAAGACGATT TTGATTTAGG TATGCGCATT
AACGTTGATG CCACTCGCAG CCTATTAGAA TTGGCTCGTC AGTCCGGCCA ATGCCCTAAA
GTGATTATTA CCAGTTCTGT GGCGGTATTC GGCGGTGCAT TGCCGAAAGT TGTTCCTGAT
AATCAGGTGT GGTCACCACA GAGTTCCTAT GGTACGCAAA AAGCATTAAA CGATTTATTG
CTGTCTGATT ACAGCCGCCG TGGGTTTATT GATGGGCGCA GCCTGCGTAT GCCAACCATC
GTGGTGAGAC CCGGTAAACC GAACCGCGCT GCCTCCAGTT TTGCCAGTGG CATTATCCGG
GAACCTCTGC AAGGGCATGA TGCTATTTGC CCCGTCAGTA TGCAAACGCC GTTATGGTTA
CTCTCACCCA AAATGGCGAT CGATAATTTG ATTCATGGCC ATGAACTGGA TGCTAGCCAA
CTCCGCTTAG GGCGGGTGAT CAACTTGCCG GGGCTATCCG TTAGTGTACA GCAGATGATT
GATGCTTTGC GCCGTCTCGC GGGTGATGAG GTCGTTGAAC GTATCCAAGT GCAGCGGGAC
CCTTCGATCG AGAGAATTGT TAATTCCTGG CCAGGTGATT TTGAAGCCAG CTATGGGAAA
GTATTAGGTT TTCGTCGCGA TCCGGATTTT GACAGTATTA TTCAGGATTT TATCATTGAG
AATTTGCCTG AATTAGCGGC AGAGAAGCAC TTTATTTAG
 
Protein sequence
MNIMITGAAG FLGRRLIDGL LQQDYLTDQQ GEPRNINKII AYDVLPLMEV EDPRVQVVCG 
DIADQNGLSA VWDRQIDTIF HLAAVVSSQA EDDFDLGMRI NVDATRSLLE LARQSGQCPK
VIITSSVAVF GGALPKVVPD NQVWSPQSSY GTQKALNDLL LSDYSRRGFI DGRSLRMPTI
VVRPGKPNRA ASSFASGIIR EPLQGHDAIC PVSMQTPLWL LSPKMAIDNL IHGHELDASQ
LRLGRVINLP GLSVSVQQMI DALRRLAGDE VVERIQVQRD PSIERIVNSW PGDFEASYGK
VLGFRRDPDF DSIIQDFIIE NLPELAAEKH FI