Gene YpAngola_A3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3029 
Symbol 
ID5801501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3200341 
End bp3201417 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content48% 
IMG OID641340866 
Producthypothetical protein 
Protein accessionYP_001607396 
Protein GI162419303 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000284319 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0136248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TCGCTTACCC TATATTGGCT TTGAGCCTAC TCACAGCACT GCCGGTATTT 
GCTGCTGACC CCGTCACATT AGCTCCGGTA CCGGACGCTA TCGCCAACCA TCAAGGTCAG
ATTAAAATTG CGGTAATCCG TAATTTAGGC TCCGATGACA ACACGACACA ATTCCTGTCC
GGTGTATTAA AAGAAGGCAA AAAGCTGGGC TTCAAAGTGG ATACGTTTCT GAGTAACGGC
GACGACGCTC GCTTTCAGGA CTTTGTCAAT CAGGCTATTA GCCAGAAATA CGACGGTATC
ATTCTTTCTC AGGGCCGCGA CCCCTACTCA ACAGAGTTGG TAAAGCGCAT TACCGATAAT
GGAATTGCCG TATCGGTATT TGACACTGCT ATTCAGGGCG AGATTCCGGG GCTGACAGTC
ACTCAACAGG ATGACGCCTC ATTAACCAAT GAATCCTTCG GTCAGTTAAT CAAAGACTTC
GATGGTAACG CCAATATTAT CAAGCTTTGG GTCGCAGGTT TCCCACCAAT GGAAAGGCGC
CAAGCCGCTT ATCAAGCATT GCTAAAGCAG AATCCTGGGA TTATTGAACT GGAATCTATC
GGTGCAGTTT CCTCTGATGT TCAGGGTGAT ACCGCCAATA AAGTCGGCGC GGTATTGGCG
AAATACCCAA AAGGTAAAAT AGATGCAATC TGGGGAACTT GGGATGCCTT CACACAGGGC
GCTTATAAAG CCTTACAAGA AAATGGCCGC ACTGAGATAA AACTGTATAG CATTGATATT
TCGAATCAAG ACTTACAACT TATGCGCGAA GCTAATAGCC CATGGAAAGT CAGTGTTGCG
GTAGATCCTA AGCTAATTGG CGCCATTAAC CTGCGCCTGA TTGCCAAGAA AATTGCGGGG
GAAACCACAC CTGCCAGCTA TGAATTCCGC GCGGCTTCCA TTCCTCAGGC ATTGCTAGCC
AGCCAACCTG GCCCGGTTAA TGTGGCGGGT TTAAGTAAGA TTATCCCCGG TTGGGGTCAG
TCTGATGATT TCAACTCTCC TTGGTTTGCG ACCCTTGCAG CCCAAAATGG TCAATAG
 
Protein sequence
MKKIAYPILA LSLLTALPVF AADPVTLAPV PDAIANHQGQ IKIAVIRNLG SDDNTTQFLS 
GVLKEGKKLG FKVDTFLSNG DDARFQDFVN QAISQKYDGI ILSQGRDPYS TELVKRITDN
GIAVSVFDTA IQGEIPGLTV TQQDDASLTN ESFGQLIKDF DGNANIIKLW VAGFPPMERR
QAAYQALLKQ NPGIIELESI GAVSSDVQGD TANKVGAVLA KYPKGKIDAI WGTWDAFTQG
AYKALQENGR TEIKLYSIDI SNQDLQLMRE ANSPWKVSVA VDPKLIGAIN LRLIAKKIAG
ETTPASYEFR AASIPQALLA SQPGPVNVAG LSKIIPGWGQ SDDFNSPWFA TLAAQNGQ