Gene YpAngola_A1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1754 
Symbol 
ID5800225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1816554 
End bp1818200 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content49% 
IMG OID641339688 
Productputative transporter 
Protein accessionYP_001606243 
Protein GI162419277 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000362629 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGAACA ACCCGAGAAA AGATATACCG CTGATTGTCA CCAGTTTGCT CTGCATCATT 
CTGATTGCTT TGGCATTCAG CTTATTTCCG CAGCAAAGCA GCCAAATGGC AGAGACCATT
TTTAGTGGCG TTACCCGGCT ACTGGGATCG ACGATCCAAA TATTAGTGCT GCTTGGCTTA
ATATTGGTAC TTTATATCGC GGTGAGCAAA TACGGCAATA TTCGTCTGGG TGAGGGAAAG
CCTGAGTATC GTACTATCCC ATGGTTGTTT ATGTTTATTT GCGCGGGACT TGGTTCATCA
ACCCTGTATT GGGGGGTGAT GGAGTGGGCT TATTACTATC AGACACCGGG TCTGAATATT
GCGCCAAAAA CGCCGAAGGC ACTGGAATAC AGCGTCAGTT ACTCCTTCTT TCATTGGGGG
ATCAGTGCGT GGGCAACTTA TGCGTTGGCC TCGCTGATTA TGGCTTACCA TTTTCATGTC
AGAAAAAACA AAGGGTTAAG TTTATCGGGG ATTATCTCGG CCATTACCGG GGTCAACCCG
CAGGGTTTTT GGGGGCGTCT GGTCGATCTG ATCTTTTTAA TCGCGACAAT AGGGGCGCTG
ACGATCTCAT TAGTGATCAC CGCCGCCACA TTTACCCGTG GACTGACTGC GCTTATCGGC
GTGCCTAATA ACTTTGCGGT GCAGGCTGGC GTGATGTTGG TAGCGGCAGT TATCTTTTCC
TTTAGTTCGT ATATCGGTAT TGATCATGGT ATGCAGCGCC TGAGCAAAAT GGTGGGATGG
GGCGCTTTTG GTTTTGCCGT TCTGGTGCTT TTAGTGGGGC CAACTGAATT CATTATCAAT
AACACCATTA ACTCCATCGG ATTAACCACA CAAAATTTCC TGCAAATGAG CCTGTTCACC
GACCCTATGG GGGATGGCCG CTTTACCCGC AGTTGGACGG TCTTTTATTG GTTATGGTGG
ATTTCATATA CCCCCGGTGT TGCGATGTTT GTGACGCGGG TTTCACGTGG TCGCAAACTC
AAAGAGGTTA TATGGGCTTT GCTGCTGGGC AGTTCGTTGG GTTGCTGGTT CTTCTTTGGG
GTACTGGAAA GCTACGCGAT ACAACAGTTT ATTAGTGGTG CGTTAGATGT TCCCCTCATC
CTAAATACTC AGGGCGGGGA AACTGCGGTA CAAATGCTGC TCAGCGCGCT ACCCATGGGC
AAATTATTCC TTGCTGCCTA TCTGTTCATC ATGATTATTT TCCTGGCTTC ACATATGGAT
GCGGTGGCGT ACTGTATGGC GGCAACCAGC ACCCGTAATC TGCGGGAAGG CGAAGATCCT
AATCGTATGC TGCGGCTGTT TTGGTGTGTA GTGATTACAC TGATCCCTCT CTCCATTCTG
TTTGCTGGTG CCTCGTTGGA TACCATGAAA ACGACGGTTG TACTCACTGC GCTGCCTTTT
TTATTGATTC TCTTGATCGA GGTTTATGGC TTTGTCCGCT GGATTAAGCA GGATTATGCC
CATGTTCCCG CGCACCTTAT TGAGCAAAGT ACCCCGCAAG TCATGTTGCC GGCTGAATCT
TTACCGCCTG CTACTGCGTC AATGTTAGTC GTGGAAACTT TAGTCGTGGA AACTGCGGCG
CCACCCGCCA GTAAGCATAA CCAGTAA
 
Protein sequence
MLNNPRKDIP LIVTSLLCII LIALAFSLFP QQSSQMAETI FSGVTRLLGS TIQILVLLGL 
ILVLYIAVSK YGNIRLGEGK PEYRTIPWLF MFICAGLGSS TLYWGVMEWA YYYQTPGLNI
APKTPKALEY SVSYSFFHWG ISAWATYALA SLIMAYHFHV RKNKGLSLSG IISAITGVNP
QGFWGRLVDL IFLIATIGAL TISLVITAAT FTRGLTALIG VPNNFAVQAG VMLVAAVIFS
FSSYIGIDHG MQRLSKMVGW GAFGFAVLVL LVGPTEFIIN NTINSIGLTT QNFLQMSLFT
DPMGDGRFTR SWTVFYWLWW ISYTPGVAMF VTRVSRGRKL KEVIWALLLG SSLGCWFFFG
VLESYAIQQF ISGALDVPLI LNTQGGETAV QMLLSALPMG KLFLAAYLFI MIIFLASHMD
AVAYCMAATS TRNLREGEDP NRMLRLFWCV VITLIPLSIL FAGASLDTMK TTVVLTALPF
LLILLIEVYG FVRWIKQDYA HVPAHLIEQS TPQVMLPAES LPPATASMLV VETLVVETAA
PPASKHNQ