Gene YpAngola_A1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1519 
Symbol 
ID5799987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1569242 
End bp1570330 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content55% 
IMG OID641339468 
Productputative ribose ABC transporter periplasmic ribose-binding protein 
Protein accessionYP_001606028 
Protein GI162421558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.11554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAG CAACGCAAGC GGGCGTGAGC CGACGCGGCC TGTTGGTTGG TGGTGCGGTA 
CTGGGATTGG GCATGGTGAT CGGGCGAGGA GCATTGGCAC AAAACCGCCT TTCGATCCCT
TACTCGAATA AAAGCCTCGA TTACTACTTC TTTGTGATTC AGGAAGCATC AGTGAAACGC
GCGGTTCTGG CACGTTCGGG GGAGTTTCAG GCGACTAACG CCAATTTTGA CAATACCCGC
CAACTCGAAC AGTGGCAAAG CCTACTGTTG TCCCGCCCTT CAGCGATTAT TTCGGACCCC
ATTGACAGCC AGGCCATTGT TTCTGCTATT CGGCGCTACA ACCGGGAGAA GATACCGGTC
GGCATCATCG ATACCCCCGC AGACGGTGGG GATGTGGCGA TTACCGTCAG CTTCGATAAT
TTTCAAGGTG GCGTCATGGC GGCGGAAGAG ATCGTTAACC GCCTGATCAC CCGATATGGC
CGCCCGAAGG GAACGGTGCT GAATTGTTAT GGCGCTCTGG CCTCAGTAGC CTGGCGCTTG
CGTAAAGAGG GGATGGATTC GGTTTTTGCC AAATATCCGG GCATCACCTA TTTGGCTCGT
CCTACCGATG GTCAGCTCGA AAAAATGCTT TCGGTCACCT TGTCCACGCT GTCGGAATAC
CCCGACCTTG ATGCTGTTCA TGCCCCTTCG GATTCCCCCT CTCGCGGTAT CGTCACTGCG
CTGCAACAGA AAGGCCGCTG GAAGAAGATA GGAGAGGCGG GACACGTCAT CTTCGTTAAC
ATTGATGGAG AACCTATTGC GCTGAAATGG ATTCAGGACG GCTATATGGA TGCTTGCATA
TCTCAGGATC CGGTGGCCTA TGGCGAGATT GCTGTCGATA TGATCGTCAA GCACGCGCTA
AAAGGTGAAG CTGTCCCGCT GGGCACTTAT CAAGACAAAA AATACTTCTG GGAGTCGGGC
GAGATTGTCC AAGGCAAGAC CGGGCCGACG CTCATCATCC CCGCCTTCGT GATCAATCGT
GACAATGTGC AGGATCCGCG CCATTGGGCC GCGGTGGCGG AGAAGACTTG GGGCATCCCC
TATACCTGA
 
Protein sequence
MTKATQAGVS RRGLLVGGAV LGLGMVIGRG ALAQNRLSIP YSNKSLDYYF FVIQEASVKR 
AVLARSGEFQ ATNANFDNTR QLEQWQSLLL SRPSAIISDP IDSQAIVSAI RRYNREKIPV
GIIDTPADGG DVAITVSFDN FQGGVMAAEE IVNRLITRYG RPKGTVLNCY GALASVAWRL
RKEGMDSVFA KYPGITYLAR PTDGQLEKML SVTLSTLSEY PDLDAVHAPS DSPSRGIVTA
LQQKGRWKKI GEAGHVIFVN IDGEPIALKW IQDGYMDACI SQDPVAYGEI AVDMIVKHAL
KGEAVPLGTY QDKKYFWESG EIVQGKTGPT LIIPAFVINR DNVQDPRHWA AVAEKTWGIP
YT