Gene TM1040_3616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3616 
Symbol 
ID4075043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp670359 
End bp671495 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content59% 
IMG OID638005135 
Productperiplasmic solute binding protein 
Protein accessionYP_611845 
Protein GI99078587 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4531] ABC-type Zn2+ transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0144381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAGAA CAGTTTTGTC CGCGCTCTTT CTCAGCCTAT CTGTTGTTCC TGCCCTTGCA 
GAGACCCCGC GTGTCGTGAC CGACATTGCA CCCGTGCAGG GTCTGGTCGC TCGGGTCATG
GATGGCGTCG GCGCGCCAGA TGTTCTGGTT CCGCCCGGAG CGTCCCCCCA TGGGCATAGC
CTGAAGCCAT CGGATGCCCG CGCGTTGACT TCTGCGGATG CGGTGTTCTG GATCGGTGAC
GAATTATCGC CGTGGCTGCT CGGCTCGCTC AAAGAGCTCG CAGGGGATGC GCATGTGGTG
TCCCTACTCG CGGCACCGCA GACGATGCGG CTTGAGTTCC GCGAGGGGGT GGTTTTTGGT
GGGGCTGACC ACGATGACCA TGGCCACGAT GATCACGACC ACGATGCCCA CGAAGATCAT
GCTCACGACG GTCACGGGCA CGAAGAGCAC GATCATGATG ACCACAAGGG CCATGATGAT
CACGGTGCAC ACGACCATGA CGACCATGCA CATGATCAAG ACGCGCATGG TCACGATGAG
GATGCGCATG ACGCCCACGA TCACGACTCG CATGAGACCG CTCACGATGA CCATGGTCAT
GGTCATGACG ATCACGCTCA CGACGGGGTT GATCCGCATG CCTGGCTGGC ACCTGAAAAC
GGCAAGCAGT GGCTGGCCTT GGTTGCCGAT GAGCTGTCCG AGATCGATCC GGCGAATGCG
GACACCTATC AGAACAACGC GCGTGCGGGC CAAGCCGAAA TCGACGCAAT TGTTGCTGCC
ACAAAGGCAG ATCTCGGCGA AGCCCATGGG CAGTTCGTGG TATTCCACGA TGCATATCAG
TACTTTGAGC AAAGCTTCGG GCTTCGTGCT CTCGGTGCAA TTGCTCTTGG AGATGCTTCC
GACCCGAGCG TCGCGCGGAT CGCAGAAATG CGCGATGCGG TTGCTGGTCA AGAGGTCTCC
TGTGTGTTCT CTGAGCCGCA ATTCAATGCA GGTCTTGTAG ACACCGTCGC TGATGGGCTC
GACATTAAGG CCGTTGTGAT CGACCCGCTG GGGACCGAAA TCGCAACTGG GCCGTCGTTC
TATACAGATC TGCTGTCCGA GATTTCTGCA GGCTTCAAAA CGTGCCTGAC GCACTGA
 
Protein sequence
MPRTVLSALF LSLSVVPALA ETPRVVTDIA PVQGLVARVM DGVGAPDVLV PPGASPHGHS 
LKPSDARALT SADAVFWIGD ELSPWLLGSL KELAGDAHVV SLLAAPQTMR LEFREGVVFG
GADHDDHGHD DHDHDAHEDH AHDGHGHEEH DHDDHKGHDD HGAHDHDDHA HDQDAHGHDE
DAHDAHDHDS HETAHDDHGH GHDDHAHDGV DPHAWLAPEN GKQWLALVAD ELSEIDPANA
DTYQNNARAG QAEIDAIVAA TKADLGEAHG QFVVFHDAYQ YFEQSFGLRA LGAIALGDAS
DPSVARIAEM RDAVAGQEVS CVFSEPQFNA GLVDTVADGL DIKAVVIDPL GTEIATGPSF
YTDLLSEISA GFKTCLTH