Gene TM1040_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3332 
Symbol 
ID4075231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp341777 
End bp342667 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content60% 
IMG OID638004840 
ProductSulfate ABC transporter, permease protein CysT 
Protein accessionYP_611566 
Protein GI99078308 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0555] ABC-type sulfate transport system, permease component 
TIGRFAM ID[TIGR00969] sulfate ABC transporter, permease protein
[TIGR02139] sulfate ABC transporter, permease protein CysT 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.845298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.297479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGCC CCGAGGGGGC CGGACGAAAC CCATACCGGA GTATCACCGT GGTGTCCCTG 
ACCCGAACCG CCCCGCGCGT GCTGCCGGGG TTTTCCCTTT CTCTTGGCGT GACACTATTG
TTCGTGACGC TGATCATCCT ATTGCCCCTA AGCGGGTTGA TGTGGCAACT CGCTCAACTG
AGCCCGGCTG ACTATATCAA GGTGATGACA TCTCGCCGCG TGCTTGTTGC GCTCAAGGTG
ACACTGTCTG CGGCGGCTCT TGCAACGCTG ATCAATGCCG TCTTTGGTCT GCTTCTGGCT
TGGGTCCTGG TGCGCTATCG CTTTTGGGGG CGCGGAGTCC TGGACGCGCT TGTCGACATT
CCCTTCGCGC TGCCGACTGC TGTGGCGGGG ATCGCATTGG TGGCGCTCTA CGACAAATCC
GGCTGGATCG GCGGGATGCT CGCGGAGTTT GACATCAAGA TCGCCTATAC TTGGTGGGGC
ATTGTAATCG CGATGGTGTT CACCTCGGTG CCCTTTGCGG TGCGCGCGAT ACAGCCCGCG
ATCGAGGAAC TGGACCCCGA TGAGGAAGCC GCGGCGCTCA CGCTTGGGGC CAGTGGTCTA
CAAAGGTTTG TGCGGGTGAT CCTGCGTCCA TTGCTGCCTG CGATCCTGAC CGGAGTTGCG
CTCTCATTTG TGCGCTCGCT CGGAGAGTTC GGCGCGGTCA TCTTCATCGC CGGGAACCTC
CCCTTCAAGA CCGAGATCGC CTCGCTTCTG ATCCTGATCC GACTTGATGA GTTCGACTAT
CCGGCGGCGG CGGCGATTGC CGGTAGCCTT CTGGGGCTAT CGCTTTTGTT GTTGATCGTG
GTCAACCTGG TGCAAACCCG GCTCTATCGC TACCTGCGGA CGGAAGGGTA G
 
Protein sequence
MFGPEGAGRN PYRSITVVSL TRTAPRVLPG FSLSLGVTLL FVTLIILLPL SGLMWQLAQL 
SPADYIKVMT SRRVLVALKV TLSAAALATL INAVFGLLLA WVLVRYRFWG RGVLDALVDI
PFALPTAVAG IALVALYDKS GWIGGMLAEF DIKIAYTWWG IVIAMVFTSV PFAVRAIQPA
IEELDPDEEA AALTLGASGL QRFVRVILRP LLPAILTGVA LSFVRSLGEF GAVIFIAGNL
PFKTEIASLL ILIRLDEFDY PAAAAIAGSL LGLSLLLLIV VNLVQTRLYR YLRTEG