Gene YpAngola_A2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2944 
SymbolthiP 
ID5801416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3098866 
End bp3100473 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content54% 
IMG OID641340791 
Productthiamine transporter membrane protein 
Protein accessionYP_001607321 
Protein GI162421651 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.305825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCC GCCGTCAGCC GTTAATCGCG CCCTGGCTAT GGCCGGGGTT GCTGGCCGCT 
GGGCTGATTG TCGCGGTCGC CTTATTGGCG TTCGCCGCTA TTTGGCACCA TGCGCCCACA
GCAGATTGGC AAAGTGTCTG GCATGACCGT TATCTGTGGC ATGTGATCCG CTTTACGTTC
TGGCAGGCAT TTTTATCCGC CCTGCTGTCG GTGATACCTG CTATCTTTCT GGCGCGGGCG
CTGTACCGCC GCAACTTCCC CGGCCGCCAG TTCATGCTGC GTTTATGTGC CATGACGCTG
GTTCTGCCGG TGTTAGTCGC GCTATTCGGC ATTCTGACGG TCTATGGCCG TCAGGGGTGG
CTGGCTCACC TGTTTGACTG GCTGGGGGTG GATTACCGCT TCTCCCCCTA TGGCTTGCAA
GGTATTTTGT TAGCGCATAT CTTCTTTAAC CTGCCACTGG CAACCCGCTT GTTGTTACAG
TCGCTAGAAG GCATCGCCGT GGAACAACGC CAACTGGCCG CACAACTGGG CATGAACGGC
TGGCAGCATT TTCGCTGGGT AGAGTGGCCT TATCTGCGTC GTCAAATCCT GCCCACTGCC
GCCCTGATTT TTATGTTGTG CTTTGCCAGC TTTGCCACCG TGCTGTCGCT GGGTGGCGGG
CCACAGGCAA CGACCATTGA ACTGGCCATC TATCAGGCAT TGAGCTACGA CTATGATTTA
GGCCGTGCCG CATTATTGGC GCTGATACAA CTGGGCTGCT GCCTGGGTTT AGTGGTACTC
AGTCAGCGGC TGAACAGCGT GTTGACGGTG GGCAATACGC ATGGGCAAAT ATGGCGTAAT
CCGCAAGACA GCGGCTGGGC GCGCCTTGGC GATACCCTGC TGATTTGTAG CGCATTACTC
TTGATGGTAC CGCCATTGCT AACAGTTGTG ATTGATGGCA GTAATAAAAC AATACTGACC
GTGTTACAAC AGCCCGCACT CTGGCAAGCG TTCTTTACCT CAGTGACCAT CGCTATTGGT
GCCGGGCTAT TGTGTGTCAT CTTAACCATG ATGTTGCTCT GGAGTAGCCG CGAACTGAAA
TTACGCCAAC GCGCGGCTTA CGGTCAGGCA TTGGAGGTCA GCGGCATGGT TATTCTGGCG
ATGCCCGGCA TTGTGTTGGC AACCGGTTTC TTCTTGTTGC TGAATGATAC GCTGGGATTG
CCACAATCTC CTTATGGGTT GGTGATCCTC ACCAATGCCC TGATGGCCGT GCCCTACGCA
TTAAAAGTGC TGGATAACCC GATGCGTGAT CTGGCTGAGC GCTACACACC GTTATGTTTA
TCACTGGATA TCCGTGGCTG GCACCGTTTA CGTTATATCG AGCTACGCGC CCTGAAACGC
CCACTGGCTC AGGCGCTGGC CTTTGCCAGT GTGTTATCTA TCGGCGATTT TGGCGTAGTC
GCGTTATTTG GTAATGAACA CTTCCGCACC CTGCCATTTT ATCTTTATCA ACAGATAGGT
TCCTACCGCA GCAATGATGG CGCAGTGACT GCCCTGTTGC TTTTGTTGCT CTGCTTCCTG
CTGTTTACCC TTATTGAACG ACTGCCGAGC CGCCATGCTA AAGCTTGA
 
Protein sequence
MAIRRQPLIA PWLWPGLLAA GLIVAVALLA FAAIWHHAPT ADWQSVWHDR YLWHVIRFTF 
WQAFLSALLS VIPAIFLARA LYRRNFPGRQ FMLRLCAMTL VLPVLVALFG ILTVYGRQGW
LAHLFDWLGV DYRFSPYGLQ GILLAHIFFN LPLATRLLLQ SLEGIAVEQR QLAAQLGMNG
WQHFRWVEWP YLRRQILPTA ALIFMLCFAS FATVLSLGGG PQATTIELAI YQALSYDYDL
GRAALLALIQ LGCCLGLVVL SQRLNSVLTV GNTHGQIWRN PQDSGWARLG DTLLICSALL
LMVPPLLTVV IDGSNKTILT VLQQPALWQA FFTSVTIAIG AGLLCVILTM MLLWSSRELK
LRQRAAYGQA LEVSGMVILA MPGIVLATGF FLLLNDTLGL PQSPYGLVIL TNALMAVPYA
LKVLDNPMRD LAERYTPLCL SLDIRGWHRL RYIELRALKR PLAQALAFAS VLSIGDFGVV
ALFGNEHFRT LPFYLYQQIG SYRSNDGAVT ALLLLLLCFL LFTLIERLPS RHAKA