Gene YpsIP31758_1640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1640 
Symbol 
ID5387794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1906884 
End bp1909748 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content51% 
IMG OID640864621 
Productputative integral membrane protein 
Protein accessionYP_001400617 
Protein GI153950864 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTATT GGCTGTTAAT TTTTTTACTT ATCTTCTTGT TTGTCATCTA TATGCAAATA 
AGTGCGTTAT CCAACTCGGT AAAATATATA CGAGAAGAGA TAGGCAGGCT GCGCAATGAG
CTTGCTCCAC GCTTGCCTGA ACCTACCGTG GCCACTGTTG AGGAAGAGAA CACGCCAACG
CATTTGGCAG ATGAAGTGGC CGTGTCCAAG GTATGGACCC CTGATTACGC AACCTATCAG
AATTATCTAC TCACCAAAAA GCAAAGCCAG ATTAAAGAGA AGGCTCTTGC CGATTCAGTA
AAAAAAGAAC CCGCCCGGCC AATAGAATAT AGGATTGCTG ACCGTCTGGT GGTGCTTAAG
CCACAGGATG AGGCCCCTCA AGAATCGCCG AAAATGGATG CAGGCCCTGC CAGTGGTCAT
CCACAGAAAC CAGACGTGTG GACAAAATTG GAAAATGTCC TGTTTTCCTT AAAATCACCG
GAGAGTTTCT GGAAACGCCT CGAGCAAGAG TTTGCCGCAC GTTGGATGGT GTGGATCGGC
GGTGTGATCA TGGCACTGGG TGTGATTTTC CTGCTGAAAG CAGGCAGCGA ACAGGGTCTG
TTCGGGCCAA CCTTGCGGAT TGGTACCGCC ATTGTGCTGA GTGTGCTGAT GGTTGTTGGG
GGGGAGTGGC TCAGGCGTTC ACGTTTTCAG TTGCCCTTGG TCAATGCCGG TTATATCCCG
GCGGCGCTGA GTGGGGCCGG TATTCTCGGT CTGTTTGCCT CAATGCTGGC GGCTAAATAT
CTCTACGACA TGTTCCCGAT GCCCGTTCTC ATGCTATTGC TCAGCCTGAT TTCTCTGTTG
GCGATGGTGC TGGCATTGTG GCAGGGGCCA TTTATGGCGG CCTTGGGGCT GCTTGGCGCC
TATACCGTCC CGTTATTTGT TTCGACGGGC AGCGGTAATG TCACCGGGCT GTTGGCCTAC
CTGTTCCTGG TTAGTTTGGC GTCTATCGGC CTGATGAGCC GTGTCTATCG GCACTGGCTA
TGGCTGGGGG CGATGGTCGG AAACTATCTA TGGCTTTTGG TTTCTTTATT CATAGCAACG
TCCGAACACC AGTTAGCGCG TTCACTATTC CTGCTAGCAA CCACATACGG CTTCCTAGCC
TGGCCACATC TGGGGTGGGC GCTAAAAAGC AAAATACGTT TTCGCCATAA TGACTGGAAA
AACTGGCCCC CGGCTTTGCT GGATCCGTTA TTAACTGGGT TGATTGCCGG GCTTTCGCTA
TTAGCCTTAA CGTTAAGCGC AAATCATTCG ATCCTGGTCT GGAGCACGTT GGTATTCTCC
TGTATTAGCC TGTTACTGAT TGCCAGACGC TCACCTTCAT TGGATTTATT TGTTCCACTG
GCCGCGATAT TAGCCATTTC TCCCTTTATT AATGCAGGAA CGCTGTTTGA GCCAGAAACC
GTCTGGACGA GTCTGCTTGC GCTCAGCACA TTAGCGCTAT TCTGCGGGAT ATATGGTAGC
TACCAGGTCA CCCGGCCGGT TTACCGCAAG CTGTGGTGGG CGTTATTGTC GGTGCTGGTG
CCAATGTATC TCGTTGTAAG TACCTATTAT GCCTGTCGCG ATAATTTTGA GCTGGCTCCA
GTTAAATACA CACTGCTGGC GGTTTGTCTG GCTATGTATA TCGCGTGGAA TGCATTCAGT
GATGGGTATC GCCGTGGCTC AGCGCTGTTG CGGACGATCT ATCAGGTCGC CGCCCAACTG
ACGTTGACCT TTGCATTATT TATGGCTTTC AGCCAGGTTA CGCTAACCCT GCTGCTCGCG
GGTCAGCTTT TAGTTTTGGT ACTCTGGCAA CGTTGGCGAC AGGTTTCGTT ACTGTCTTGG
ATTTTGAAAC TGCATGCGAT CATACTGCTG GTTCGCCTGT CAATGAACCC GTATATCCTT
GGATATTCCT CGCCGTTGCA CATTGGTTCG GTCGTTATTC CTTGGACGCT ATACGGTTTT
GGGATTCCGA TTCTCTGCCT GTGGCTGAGT TCACGCTGGT TACCTGCGCG TCGCGGTGAG
CAGACCGCCA ATTGGTTAGC GGCCAATGCT ATTTATCTGT TGGCCTTGTG GATCAACGTT
GAATTACGTC ACCTGCTGCA TGGCAGCTAT CATCTGTCGC TGGATTTCGC TTCTCTTACC
GATTGTGCGT TGCATGGGGT GACCTTCGGG CTGATGGCGC TGGCCTACGG CTATCGTGAA
CAGTTTGCAG ATTCGCTACA ACGGATATAC CGCCTGGCGG CTAAAACCAG TGCCATTCTG
ATGCTCACCC TAACGTTGCT CTGCCTGGTT GTCTTTAACC CCATCTGGTC AGCACAACAG
GTTGGCGGCC TACCGCTGCT GAATATGGTG ACCGTTGCCT ATGCGATCCC GATGGTGTTC
ATGGTGGTAG CACTAAGATA CTGGCCAGAT CACTGGCTAG ATATCCGTCT GAAGCCCTAC
GTTATCGGTG CGATTGCGCT ACTGGGAATG GTGTTCATTA CCTTGACGGT CCGCCAGCTC
TGGCATGGTG ACCGGTTGGA TAATGATATC GTGTACAGCG GCGAACAATA CTCTTACTCC
TTAGCGTGGA TGTTAACTGC CATTGGCATG ATGTACAGCG CAATACACTG GCCTAACTAT
AAAGTGCGCC GCGCCTCCCT GGGGTTGTTG GCGCTCACGA TCGTCAAACT GTTCCTGTGG
GATATGTCTG GTCTTGAAGG GCTATATCGT TCGGTCTCAT TCCTCGGTCT GGGGCTGTGT
CTGGTAGCGA TTGGTTGGTT CTACCAGAAG TTTGTCGTTA TGAAGGAAGT CATTGAGCAA
AACAAGGATC AGCCGACAGC GGACGTGGTA CAAAGCGGGC AATAA
 
Protein sequence
MEYWLLIFLL IFLFVIYMQI SALSNSVKYI REEIGRLRNE LAPRLPEPTV ATVEEENTPT 
HLADEVAVSK VWTPDYATYQ NYLLTKKQSQ IKEKALADSV KKEPARPIEY RIADRLVVLK
PQDEAPQESP KMDAGPASGH PQKPDVWTKL ENVLFSLKSP ESFWKRLEQE FAARWMVWIG
GVIMALGVIF LLKAGSEQGL FGPTLRIGTA IVLSVLMVVG GEWLRRSRFQ LPLVNAGYIP
AALSGAGILG LFASMLAAKY LYDMFPMPVL MLLLSLISLL AMVLALWQGP FMAALGLLGA
YTVPLFVSTG SGNVTGLLAY LFLVSLASIG LMSRVYRHWL WLGAMVGNYL WLLVSLFIAT
SEHQLARSLF LLATTYGFLA WPHLGWALKS KIRFRHNDWK NWPPALLDPL LTGLIAGLSL
LALTLSANHS ILVWSTLVFS CISLLLIARR SPSLDLFVPL AAILAISPFI NAGTLFEPET
VWTSLLALST LALFCGIYGS YQVTRPVYRK LWWALLSVLV PMYLVVSTYY ACRDNFELAP
VKYTLLAVCL AMYIAWNAFS DGYRRGSALL RTIYQVAAQL TLTFALFMAF SQVTLTLLLA
GQLLVLVLWQ RWRQVSLLSW ILKLHAIILL VRLSMNPYIL GYSSPLHIGS VVIPWTLYGF
GIPILCLWLS SRWLPARRGE QTANWLAANA IYLLALWINV ELRHLLHGSY HLSLDFASLT
DCALHGVTFG LMALAYGYRE QFADSLQRIY RLAAKTSAIL MLTLTLLCLV VFNPIWSAQQ
VGGLPLLNMV TVAYAIPMVF MVVALRYWPD HWLDIRLKPY VIGAIALLGM VFITLTVRQL
WHGDRLDNDI VYSGEQYSYS LAWMLTAIGM MYSAIHWPNY KVRRASLGLL ALTIVKLFLW
DMSGLEGLYR SVSFLGLGLC LVAIGWFYQK FVVMKEVIEQ NKDQPTADVV QSGQ