Gene TM1040_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3333 
Symbol 
ID4075232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp342686 
End bp343723 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content58% 
IMG OID638004841 
Productthiosulphate-binding protein 
Protein accessionYP_611567 
Protein GI99078309 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4150] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.885845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.45738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTCT CCTCCAAACC GTTTCTCCGG TCCTCTCGGG TTCTGGTTCG CAGCCTCGGT 
GCTGCCGCGA TCGCGCTTGC TGGTTTTGCG AGTTCGGCCG AAGCCGAAGA GCAGGAAATC
CTCAATGTTT CCTACGATAT TGCCCGTGAA CTCTACGCGG CGCTGAACCC TGTTTTTGCC
GAAAACTGGC AGGCAGAGAC GGGCGAGACA CTGACCATCA AACAAAGCCA TGCTGGCTCT
TCCAAACAGG CTCGAGCGAT CCTGCAAGGT CTGCAGGCGG ATCTCGTGAC CTTCAATCAG
GTGCTGGATG TGCAGATCCT CGCAGACAAG GGTTTTGTGG CGCAGGACTG GCAGCAAAAG
CTGCCCAATA ACGCATCGCC TTACTACTCG CTCCCGGCTT TCCTGGTGCG GGGTGGCAAC
CCCAAGGGTA TTGAAGACTG GGACGATCTG ACCCGTGATG ACGTGGAACT CGTGTTCCCG
AACCCAAAAA CCAGCGGCAA TGCGCGCTAC ACCTATCTCG CGGCCTACGC CTATGCGCTT
GACAAGTTTG GAGGCGATGA GGCCGCGGCG CAGGAATTTG TCGGTAAGAT CCTCTCCAAT
GTCGTGGTGT TCGACACCGG TGGGCGTGGT GCGACAACGA GCTTTGTCGA GCGTGAGCTT
GGCGATGTGC TGATTACCTT CGAGGCCGAG GTCGAGAACA TCCGCGCCAG TGAGGATGAA
GGTGCCTTTG ATCGTGTGGT GCCTGCAATC TCCCTCTTGG CAGAGTTCCC TGTGGCGCTG
GTTGACAAGG TGGCAGATGC ACGGGGCAGC CGAGCCGTTG GCGAGGCCTA TCTCGACTTT
CTCTACTCCA AGGACGCGCA GGAAGTCATT GCCGGTTTCA ACAACCGTGT GCATCACCCC
GAGGTGGTGG CTGCAACAGC CGACAAGTTC CCTGATGTGC GTCTGATCAC GGTCGAAGAA
GTCTTTGGCA GCTGGGCCGA AGCGCAGGAG ACCCACTTTG GCGAGGGTGG TACGCTCGAC
CGGGTCTTCA CCAACTAA
 
Protein sequence
MPLSSKPFLR SSRVLVRSLG AAAIALAGFA SSAEAEEQEI LNVSYDIARE LYAALNPVFA 
ENWQAETGET LTIKQSHAGS SKQARAILQG LQADLVTFNQ VLDVQILADK GFVAQDWQQK
LPNNASPYYS LPAFLVRGGN PKGIEDWDDL TRDDVELVFP NPKTSGNARY TYLAAYAYAL
DKFGGDEAAA QEFVGKILSN VVVFDTGGRG ATTSFVEREL GDVLITFEAE VENIRASEDE
GAFDRVVPAI SLLAEFPVAL VDKVADARGS RAVGEAYLDF LYSKDAQEVI AGFNNRVHHP
EVVAATADKF PDVRLITVEE VFGSWAEAQE THFGEGGTLD RVFTN