Gene YpAngola_A2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2266 
Symbol 
ID5800736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2375815 
End bp2376843 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content48% 
IMG OID641340163 
Productputative sodium/bile acid symporter family protein 
Protein accessionYP_001606708 
Protein GI162420554 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATGGT TACAACGCTT ACAGATAGAT AAGTTTTTGT TAGTGCTGAT ACTGGTGGTG 
ATTATCGCCT CTATCTTCCC TTGTGAAGGG GAAACCAAAG TGTGGTTTGA AAGGCTGACG
ACCGCCGCCA TTGCTTTGTT GTTCTTTATG CACGGTGCCA AGTTATCACG TGCGGCAATA
ATGACAGGGA TGGGGCATTG GAAGCTGCAT TTGGTGGTTT TTCTCAGTAC GTTTGCACTA
TTTCCCCTAT TAGGGGTGGG GATGAATGTA CTGGTGCCCA ATGTGTTAAC ACCCACGCTA
TATTTAGGGT TTCTCTATCT TTGTGCTTTA CCGGCAACGG TTCAGTCGGC GATTGCGTAT
ACCTCAGTCG CCGGGGGGAA TGTTGCGGCG GCTATCTGTA GCGCATCGGC ATCCAGTATC
CTCGGGGTTT TCCTGTCACC CATTTTAGTT GGCATGTTGA TGCAGACACA GGGCGGTGAT
ACCGACACCT TACATGCTAT CGGCTCCATC ATTATGCAAT TAATGGTACC GTTTGTTGTG
GGGCATCTAT CGCGGCCACT GATAGCTAAA TGGGTTGAAC GGCACAAAAA ACTGGTCAAT
ATCACTGACC GGTCATCTAT TTTGTTGGTG GTTTATGTGG CGTTCAGCGA AGCGGTCGTT
CAAGGGATTT GGAGTCAAAT CGACGGCTGG TCACTTTTGG CCATATCAGT TTGTTCAATG
GTATTACTAA CGGTGGTATT GGTGGTTAAT ACCCTGGCCG CTCGTTGGCT GGGCTTTAAT
ACTGCTGATG AGATTACCAT CGTGTTCTGC GGTTCGAAGA AAAGTTTGGC GAACGGTATT
CCGATGGCTA ACGTGTTGTT TCCTGCCTCA GTCGTGGGGG TTATGGTATT GCCGCTAATG
ATATTCCATC AGATTCAATT GATGGTCTGT GCGGTACTGG CACAACACTA CGCGAAGCGA
ATGGCGAGAG AACAAGCTGA AAAAGGACTG GACGTAATGC CGACAGTAAA TGACAGTAAA
ACTCAGTGA
 
Protein sequence
MSWLQRLQID KFLLVLILVV IIASIFPCEG ETKVWFERLT TAAIALLFFM HGAKLSRAAI 
MTGMGHWKLH LVVFLSTFAL FPLLGVGMNV LVPNVLTPTL YLGFLYLCAL PATVQSAIAY
TSVAGGNVAA AICSASASSI LGVFLSPILV GMLMQTQGGD TDTLHAIGSI IMQLMVPFVV
GHLSRPLIAK WVERHKKLVN ITDRSSILLV VYVAFSEAVV QGIWSQIDGW SLLAISVCSM
VLLTVVLVVN TLAARWLGFN TADEITIVFC GSKKSLANGI PMANVLFPAS VVGVMVLPLM
IFHQIQLMVC AVLAQHYAKR MAREQAEKGL DVMPTVNDSK TQ