Gene TM1040_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3603 
Symbol 
ID4075030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp652050 
End bp653573 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content56% 
IMG OID638005122 
Productextracellular solute-binding protein 
Protein accessionYP_611832 
Protein GI99078574 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.721238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATC TTTCTCTCCC CCGTTCGGCT TTGATTGCCC TGACGCTTGC TTCGACCACC 
GCGATGCCCG CGCTTGCAGA GAAAGCTGCA GGCACCTTGA ACGTCGCCTT CACCAAAGAG
CTCGAGAACG TCGACAGCTA TTTCAATTCA TCGCGCGAAG GCGTGGTGAT GCAGCGCGCG
GTCTGGGATG GCCTGATCTA CCGCGATCCC AACACCAACG AATACATCGG CAACCTTGCA
ACCAGCTGGG AATGGATCGA CGACACCACG CTGGAATTCA AGCTGCGTGA GGGCGTTACT
TTTCACAATG GCGAGCCCTT CAACGCCGAT GACGTCGTCT ACACCGTGAA CTATGTCGCC
AACGAGGAAA ATGGCGTCAA AACCCAGCGC AATGTGAACT GGATGAAGTC CGCAGAAAAG
ATCGACGACT ACACCGTCCG GATCCACCTC AAGGACAAAT TTCCCGCTGC GATCGAGTTC
CTCTCCGGCC CAGTTTCGAT GTATCCCAAT GAGTATTACG CCGAGGCAGG CCCCTCTGGC
ATGGGACTGA AGCCCATCGG CACCGGGCCT TACAAGGTGA CAGAAGTGGT TCCGGGCCAG
CATTTTGTGC TTGAGGCCAA CGAGACCTAT CACGACAGCC CCAAGGGTCA GCCGGAGATC
GCAAAAATCG ACATCCGCAC CATTCCAGAC GTCAATACTC AGATGGCAGA GCTCTTTTCC
GGCTCTCTGG ATCTGATCTG GCAGGTGCCC GCGGATCAGG CCGAAAAACT TGCGCAACTG
GGCCAGTTCA CCGTCGCCAA TGAATCCACG ATGCGTGTGG GCTACCTGCA AATGGACTCG
GCCGGTCGAT CAGGTGAGGA CAACCCGTTT ACCAACGCCA AGGTGCGCGA GGCCGTGAAC
TATGCGATCA ACCGTCAGGA ACTGGTCGAT GCCCTGCTCA AGGGCTCCAG CCAAGTTGTC
TACACCCCCT GTTTTCCAAG CCAGTTTGGC TGTGTGCAGG ATGTGACCAC GTATGAGTAC
AATCCCGAAA AGGCGAAGGA GCTGCTGGCA GAGGCAGGTT ATCCCGACGG GTTCTCGACA
GAATTCTATG CCTATCGCGA CCGCCAGTAT GCAGAGGCCA TCGTCTCTTA CCTGAATGCC
GTGGGCATCG ATACCGATTT CAAGATGCTA CAGTATTCAG CGCTGCGCGA CCTGAACATG
AAGGGCGAAG TGCCGCTGTC GTTCCAAACC TGGGGCAGCT ATTCGATCAA TGACGCGTCT
GCGATGGTTA GCCAGTTCTT CAAACACGGC TCGCTCGACA GCACCCGCGA CGATGAAGTG
CTCGATTGGC TAAATGTGGC CGACAGCTCC ACCGATCCCG ACGAGCGGAT CGAGTATTAC
ACCAAGGCGA TCCAGAAGAT CACAGGCGAG GCCTACTGGG CACCCATGTT CAGCTACAAC
ACGAACTATG TCTTCACCAG CGACGTGAGC TACACGCCCA CCGCAGACGA AGTTCTGCGT
TTTGTGGACA TGTCCTGGAA CTGA
 
Protein sequence
MFDLSLPRSA LIALTLASTT AMPALAEKAA GTLNVAFTKE LENVDSYFNS SREGVVMQRA 
VWDGLIYRDP NTNEYIGNLA TSWEWIDDTT LEFKLREGVT FHNGEPFNAD DVVYTVNYVA
NEENGVKTQR NVNWMKSAEK IDDYTVRIHL KDKFPAAIEF LSGPVSMYPN EYYAEAGPSG
MGLKPIGTGP YKVTEVVPGQ HFVLEANETY HDSPKGQPEI AKIDIRTIPD VNTQMAELFS
GSLDLIWQVP ADQAEKLAQL GQFTVANEST MRVGYLQMDS AGRSGEDNPF TNAKVREAVN
YAINRQELVD ALLKGSSQVV YTPCFPSQFG CVQDVTTYEY NPEKAKELLA EAGYPDGFST
EFYAYRDRQY AEAIVSYLNA VGIDTDFKML QYSALRDLNM KGEVPLSFQT WGSYSINDAS
AMVSQFFKHG SLDSTRDDEV LDWLNVADSS TDPDERIEYY TKAIQKITGE AYWAPMFSYN
TNYVFTSDVS YTPTADEVLR FVDMSWN