Gene YpsIP31758_3627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3627 
Symbol 
ID5386876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4093756 
End bp4095087 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content46% 
IMG OID640866647 
ProductCBS/transporter associated domain-containing protein 
Protein accessionYP_001402581 
Protein GI153948297 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAATA GTATCTTACT GATTCTTTTT TTAATTGCGG TCAGCGCCTT CTTCTCGCTA 
TCAGAGATTT CATTGGCGGC TTCACGCAAA ATTAAACTAA AACTGCTGGC GGACGAGGGC
GATACCAACG CCTTACGAGT CCTGAAACTG CAAGAGACGC CAGGAATGTT CTTCACCGTG
GTCCAAATTG GCCTGAATGC TGTCGCCATT CTTGGTGGTA TTGTCGGTGA TGCCGCTTTC
TCCCCTTCGT TCAAACTCGT TTTTGAGCGT TTTATGGCTC CTGAGTTGGC CGATCAAGCC
AGTTTCGTTT GTTCTTTCGT GTTAGTGACC AGCTTATTTA TTCTGTTTGC TGATTTAACC
CCGAAACGCA TCGGTATGAT TTCACCTGAA GCGGTTGCCG TCCGGATCGT CAACCCAATG
CGCTTCTGCC TAATGATCTT CCGCCCATTA GTCTGGTTCT TCAATGGGAT GGCAAATCTT
ATCTTCCGCC TATTTAAATT ACCCATGGTC CGTAACGATG ACATCACTTC CGATGATATC
TATGCCGTGG TAGAAGCCGG TGCGCTCGCC GGAGTGCTAC GCAAGCAAGA GCATGAGTTG
ATTGAAAACG TCTTTGAGCT GGAGTCTCGA ACCGTTCCTT CCTCCATGAC TTCACGTGAA
AACGTGATTT ACTTTGATCT ACGGGAAAGC GAAGACAGTA TCAAAGATAA AATCTCCACA
CATCCGCACT CAAAATTCCT GGTATGTGAT GGTCACATTG ACCAAGTGGT GGGTTACGTT
GACTCTAAAG ACTTGCTGAA TCGGGTATTA GGTAACCAAA GTCTGGTACT CAGCAGTGGC
GTACAAATTC GTTCAGCTCT GATTGTGCCA GATACATTGA CACTTTCAGA AGCGTTGGAG
AGTTTTAAAA CCGCAGGTGA AGACTTCGCC GTGATCCTCA ACGAATATGC TTTAGTTGTT
GGGATAATTA CACTGAATGA CGTAATGACC ACGTTGATGG GCGATTTAGT TGGCCAAGGG
CAGGAAGAGC AAATTGTTGC CCGCGATGAG AATTCATGGC TGATTGAGGG CGGTACACCA
ATTGAAGATG TCATGCGCGT ACTGCATATC GACGATTTCC CGCAATCGGG CAATTATGAA
ACTATCGGCG GCTTTATGAT GTATATGCTG CGTAAAATTC CTAAACGAAC TGATTTTGTT
AAATATGCGG GTTACAAATT TGAAGTCGTC GATATTGATA GCTACAAGAT AGACCAGCTA
CTGGTGACAA GGCTCAGTGA CCAGCCAGCG CCAATCCTGC CAAAAGCACC ACACGAAAGC
AGTGACGCCT AG
 
Protein sequence
MLNSILLILF LIAVSAFFSL SEISLAASRK IKLKLLADEG DTNALRVLKL QETPGMFFTV 
VQIGLNAVAI LGGIVGDAAF SPSFKLVFER FMAPELADQA SFVCSFVLVT SLFILFADLT
PKRIGMISPE AVAVRIVNPM RFCLMIFRPL VWFFNGMANL IFRLFKLPMV RNDDITSDDI
YAVVEAGALA GVLRKQEHEL IENVFELESR TVPSSMTSRE NVIYFDLRES EDSIKDKIST
HPHSKFLVCD GHIDQVVGYV DSKDLLNRVL GNQSLVLSSG VQIRSALIVP DTLTLSEALE
SFKTAGEDFA VILNEYALVV GIITLNDVMT TLMGDLVGQG QEEQIVARDE NSWLIEGGTP
IEDVMRVLHI DDFPQSGNYE TIGGFMMYML RKIPKRTDFV KYAGYKFEVV DIDSYKIDQL
LVTRLSDQPA PILPKAPHES SDA