Gene TM1040_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0835 
Symbol 
ID4077541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp879761 
End bp880786 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content56% 
IMG OID638006133 
Productextracellular solute-binding protein 
Protein accessionYP_612830 
Protein GI99080676 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.632681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.843315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATC TGATTGCAGC CACAACGCTT GCGACCGTCT CCGCGACGGC GGCCATGGCG 
GAGGGGCAGT TGAATATCTA CAACTGGGGC AACTACACCA GCCCGGAATT GATCGAGAAG
TTCGAACAGG AGTTCGACAT CGACGTCACC ATCACCGACT ATGACAGCAA CGACACCGCA
CTGGCCAAGA TCAAGGCAGG TGGGCATGGC TTTGACATCG TGGTGCCGTC TGGCACCTAT
GTGCCGATCT TCATCGGCGA AGGCCTGTTG ATGAAGTCGA TGCCCAACCA GATGGAGAAC
TTCAAGAACA TGGACCCGCG CTGGGTCGAT GTGGATTTTG ATCCTGGCCG CGACTACACC
GTGCCTTGGC AATGGGGCAC CGTGGGCGTC ACCGTCAATA CTTCGGTTTA TTCAGGCGAC
ATCAACTCGG CGGCACTGAT CTTTGATCCG CCAGAAGAGC TGAAGGGCAA GATCAACGTC
GTTCCAGAGA TGCTCGACGT GATGGGCATG GCCATTCACT ACATGGGCGG AGAGCAATGC
ACCGCCGACA AGGACATGCT GGCCAAAGTG CGCGATAAAC TGGTCGAGGC CAAAAAGGAC
TGGCTCTCCA TGGCCTATGG CAACATCGAG AAGTTCGCCA AGGGCGACCT CGCGGCTGGG
GTCAATTGGA ACGGCGCCTC ATTCCGGGCA CGTCTGCAAA ACGATGACAT CGCCTTTGGC
TATCCACAGA CCGGGTTTTC GATCTGGATG GACAACGCCG CGATCCTCGC GGATGCGCAG
AATGTCGACA ATGCCAAACT GTTCCTGAAC TATATCATGG CTCCGGAGAA CGCAGCGCTT
CTGTCCAATT TTGCCCGCTA CGCCAATGGC ATCAAAGGAT CTGAACCCTT TATGGATGCG
GCCATGGCAG AGGCCTCCGA GGTGGTTATT CCCGACGAGC TCAAAGATGC CGGCTATCTC
GCCAAGACCT GCCCACCCGA CGTGCAGCGG ATCTATTCCA AGATCTGGAC CGAAGTGACC
AAATAA
 
Protein sequence
MRHLIAATTL ATVSATAAMA EGQLNIYNWG NYTSPELIEK FEQEFDIDVT ITDYDSNDTA 
LAKIKAGGHG FDIVVPSGTY VPIFIGEGLL MKSMPNQMEN FKNMDPRWVD VDFDPGRDYT
VPWQWGTVGV TVNTSVYSGD INSAALIFDP PEELKGKINV VPEMLDVMGM AIHYMGGEQC
TADKDMLAKV RDKLVEAKKD WLSMAYGNIE KFAKGDLAAG VNWNGASFRA RLQNDDIAFG
YPQTGFSIWM DNAAILADAQ NVDNAKLFLN YIMAPENAAL LSNFARYANG IKGSEPFMDA
AMAEASEVVI PDELKDAGYL AKTCPPDVQR IYSKIWTEVT K