Gene YpAngola_A0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0195 
Symbolgph1 
ID5798659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp208800 
End bp210224 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content44% 
IMG OID641338213 
Productsugar (glycoside-Pentoside-hexuronide) transporter 
Protein accessionYP_001604819 
Protein GI162419296 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0477575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGG ACAGCCAGCA CCTCTGTAGC TTGAAGGACG AAGGGGATAA AAGGCTAAAA 
ATAAATATGA ACAATTCAGG CATGAAAATC CCAATGAAAG AGCTGGCGTC TTATTTTGGG
TATGGGATGG GGCAGTGTTT CAGTTTTGGG TTGGTTGGGA CTTTTATTCT CTTTTTCTAT
ACTGACATTA TGGGGATCTC GCCGGTAGCA GCCAGTATGA TCTTTTTAAT TGCCAGGGTA
TGGGATGCTA TACATGACCC ATTAATTGCC GGTGTAATGG ATACTATTAA TATGCGGCGC
GGGAAATTCC GCCCATATTT GCTGTTTACT CCCTTCTTGA TTTTCCTTGT TACGGTCGCT
GCGTTTTATA ATATCGAAGC CAGCTTGATG ACTAAAACGA TCTACGCAGG TGTGACTTAT
ATCCTGTGGG GGACGCTCTA CGCTCTCTCT GATATCCCGT TTTGGTCAAT GAGCACCGTG
ATGACGGATG AACCGCAAGA GCGAGCTAAA ACAGCGACCT GCGCGATGTT GGGCGTGAAT
GCGGGTATCG GCGCTACGAT GATTTTATTC CCTTATATCA GCGGGTTATT TGCTGAGAAC
AGTGCTGATC GTGGTTATTT TGCGGGTGTT GTTATCCTCA TGGTTTTGGG GGTGATACTC
ATGTTGAATG GGTTCTTTAA TACAAAAGAA CGCGTCAACG TCACGGTGAC GGAAAAGGTC
ACGCTAAAGC AGACATTTAT TGTGGTCTGG CAAAATAAGC CGCTATTTTT TATTCTTAGC
GCCTTTTTCA TGAATGTGTT TTCTAACATC GTGAATACTT TTTATATTTT CTTTTTCACT
TATAACATGG GGGATGCTGA GCTGGTTTCT GTTATCGGCT TAATTACATT CACCTGTGCT
TTAGCTTGTC TGGGAACGCC ATTCTTAACC CGCCATTTTA AGAAACGAGA TTTGTTCATT
ACATTATGTG TGTTGGAGAT CATCGCTCGC GTTGGTTTCT GGTTCACCGG TTATAATAAT
GTCGTGTCAG TCATGGTATG GCTAACCGTG ATCACTGCCA TCTTCATGAT GACGAATCCA
CTTATTTCCG CGATGATTGC CGATACTGTG GAATATTCCT ACTATCACAC CGGTAAGCGC
TGCGCGGCCA TCACGTTCTC CGGACAGACT TTTGTCGGTA AATTGTCGGT CGCTGTTGCT
GGCGGTGTCT CTGGCCTGAT CCTGTCAATA TTGGGATATA TGCCTAATGT GGCCCAATCG
ACATGGACAT TGAATGGCCT ATTTTTCTGT ATTTCTCTGT TGCCCGCCGT GGGTGCCGTG
GTGCGTATCC TCATTATGCG TAAATATAAA TTTACCGAAG ATGAACATGC AATTCTTCGT
GAAGAACTGA AACAAGGGAG ATTCCACTCT TCAGTAGGCA AATAA
 
Protein sequence
MSLDSQHLCS LKDEGDKRLK INMNNSGMKI PMKELASYFG YGMGQCFSFG LVGTFILFFY 
TDIMGISPVA ASMIFLIARV WDAIHDPLIA GVMDTINMRR GKFRPYLLFT PFLIFLVTVA
AFYNIEASLM TKTIYAGVTY ILWGTLYALS DIPFWSMSTV MTDEPQERAK TATCAMLGVN
AGIGATMILF PYISGLFAEN SADRGYFAGV VILMVLGVIL MLNGFFNTKE RVNVTVTEKV
TLKQTFIVVW QNKPLFFILS AFFMNVFSNI VNTFYIFFFT YNMGDAELVS VIGLITFTCA
LACLGTPFLT RHFKKRDLFI TLCVLEIIAR VGFWFTGYNN VVSVMVWLTV ITAIFMMTNP
LISAMIADTV EYSYYHTGKR CAAITFSGQT FVGKLSVAVA GGVSGLILSI LGYMPNVAQS
TWTLNGLFFC ISLLPAVGAV VRILIMRKYK FTEDEHAILR EELKQGRFHS SVGK