Gene YpAngola_A1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1497 
Symbol 
ID5799965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1547659 
End bp1549467 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content48% 
IMG OID641339449 
ProductABC transporter periplamic substrate-binding protein 
Protein accessionYP_001606009 
Protein GI162419645 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTAC GCCTATTTGC TACCTTGATA CTTTCGGCCA TGAGTTTTGG GTTAGCTGCC 
GAAACCATTA AAGAGGGCGT TTCTTTTGCC ATATTGGGCG ACCCCAAGTA CTCATCGGAT
TTTAGCCATT TTGATTATGT CAATCCAGCG GCCCCTAAAG GGGGAAATAT TACTCTGTCG
GCGATTGGCA CATTTGATAA TTTCAACCGG TATGCATTGC GTGGTACCTC TGCAGCTCGA
ACGGAGAGAC TCTATGATTC ATTGTTCAGT CCCTCAAGCG ATGAGATTGG CAGCTACTAT
CCATTAATTG CGGAATCAGC GCGCTATTCT CCCGACTTCA GTTGGATTGA GGTTGATATC
AACCCCCACG CCCGTTTTCA TGATGGTGCC CCGATTACTG CCGAGGATGT GGCCTTCACC
TTCAACAAAT TTATGACTGA AGGGGTACCA CAATTTCGAT TGTTCTATAA AGGCGTTAAA
GTCAACGCTA TTTCCCGTCT GACGGTACGT TTTGAATTTC CTGAACCTAA CAAAGATAAA
ATGCTCGGCC TACTTGGTTT TCCGGTGATG CCCAAACATT TCTGGAAAGA TCATAAGCTG
AGTGACCCAC TCAGTACCCC TCCAGTCGCC AGCGGCCCTT ACCGCATCAG CAAATACAAA
ATGGGCCAAT ATGTCACTTA TGAACGGGTA AAAGATTACT GGGCCGCCAA TCTGCCGGTT
AACCGGGGCC AATATAATTT TGACACTATC CGTTATGATT ATTATCTGGA CGACAAAGTC
GCCTTAGAAG CCTTTAAAGC CGGAGCCTTT GATTTACGGA CGGAGTCCTC ACCGAAAAGC
TGGGCGACGC AGTATGCGGG CGGTAACTTC GCCAAGAACT ATATCGTCAA ACAGGATATT
ACCGACAACT CGGCACAGAA CACTCGCTGG TTAGCATTTA ATGTGCAACG GCCCCTCTTC
AGTGACCGAC GCGTCCGCCA GGCGCTGACT TTGGCCTTCG ATTTTGAATG GATGAACAAA
GCCTTTTATT TCAATAGCTA TCAGCGTGCC AATAGCTTCT TCCAAAATAC CGAATATGCG
GCAACAGGTT ATCCGGACAG TGCAGAACTG GCTTGGCTCG CGCCATTGAA AGATAAAATA
CCCGCGGAAG TTTTCACCCA GATTTATCAA CCACCACACA CGGATGGTAG CGGCTATTCG
CGAGATAACC TGCTCAAAGC CAAAGAGTTA CTGAATGAAG CGGGCTGGGA AGTCAAAGAC
CAACAACTGG TGAATAGTAA AACCGGTAAA CCTTTTGAAT TTGAATTGCT ATTACCCAGC
GGCAGCGATT TTCAGTATGT GCAACCGTTC AAACACAATT TGCAGCGTTT AGGTATCACG
ATGAAGATCC GCGAAGTTGA CAGCTCGCAG TTTATTAATC GCTTACGCAG CCGGGATTTC
GATATGATCC CAACGGTTTA CAACGCCTTC CCTTACCCAA GCCCAGACCT GCAAATCTTG
TGGAGTTCAG CCTACATTGA TTCCACCTAT AACAGACCCG GTATTAAAGA TCCGGCCATC
GACCAATTAA TCGCCCAGAT TGTCAGCCAT CAGGATCAAC CAGAGGCCTT ACTCTCTCTG
GGGCGTGCCC TTGACCGGGT ATTGACCTGG AATTATTTGA TGATCCCGAT GTGGTATTCC
AATCATAGCC GCTTCGCCTA TTGGGATAAA CTCTCGATGC CAGCGGTTCG TCCCACCTAT
TCGCTAGGGT TTGATAGCTG GTGGTTTGAT GTTAACAAGG CAGCTCGCCT GCCAGTAGAA
CGTCGTTAG
 
Protein sequence
MFLRLFATLI LSAMSFGLAA ETIKEGVSFA ILGDPKYSSD FSHFDYVNPA APKGGNITLS 
AIGTFDNFNR YALRGTSAAR TERLYDSLFS PSSDEIGSYY PLIAESARYS PDFSWIEVDI
NPHARFHDGA PITAEDVAFT FNKFMTEGVP QFRLFYKGVK VNAISRLTVR FEFPEPNKDK
MLGLLGFPVM PKHFWKDHKL SDPLSTPPVA SGPYRISKYK MGQYVTYERV KDYWAANLPV
NRGQYNFDTI RYDYYLDDKV ALEAFKAGAF DLRTESSPKS WATQYAGGNF AKNYIVKQDI
TDNSAQNTRW LAFNVQRPLF SDRRVRQALT LAFDFEWMNK AFYFNSYQRA NSFFQNTEYA
ATGYPDSAEL AWLAPLKDKI PAEVFTQIYQ PPHTDGSGYS RDNLLKAKEL LNEAGWEVKD
QQLVNSKTGK PFEFELLLPS GSDFQYVQPF KHNLQRLGIT MKIREVDSSQ FINRLRSRDF
DMIPTVYNAF PYPSPDLQIL WSSAYIDSTY NRPGIKDPAI DQLIAQIVSH QDQPEALLSL
GRALDRVLTW NYLMIPMWYS NHSRFAYWDK LSMPAVRPTY SLGFDSWWFD VNKAARLPVE
RR