Gene TM1040_3383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3383 
Symbol 
ID4075282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp398618 
End bp400162 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content57% 
IMG OID638004891 
ProductBCCT transporter 
Protein accessionYP_611617 
Protein GI99078359 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.198067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACA CATCTTTGGA AACACATCTC AAGAAGAGCC GGCTCAACAC GCCGCTTTTC 
CTGATTTCGG GGGGCTTTAT CGCGCTCTTT TGTATCGCCG CATTGATCGA CCTTGAGGGG
TTGTCGGCGG CTGTGGATTG GGGCTTCAAC GTCTCGGCCA CCTATTTCGG CCTCTACTGG
CAGGTTCTCC TTCTGGCGAC CTTCCTGATC GGCCTCGTGC TCTGCGTTCT ACCAGGTGGC
AAGGCAATCA TGGGCAATCT GGCCGCGCCG GAATTCACCC TGTTTCAGTG GGGCTCCATG
ATCATGTGTA CCCTGCTTGC CGGCGGGGGT GTATTCTGGG CTGCGGGGGA GCCAATCGCA
CATTTCCTTT ATTCACCACC GCTCTATGGA GCTGAAGGTG GGACCGAGGC GGCCGTCAAT
CCCGCAATTG CGCAGAGCTT CATGCATTGG GGCTTTCTTG CTTGGGCCAT TCTTGGTTCG
CTCTCGACCG TGATGTTGAT GCATTATCAC TACGAAAAGG GCCTGCCGCT TGCTCCGCGC
ACGCTGCTCT ACCCGCTGTT TGGTGACAAG GCGATCAACG GGCCAATCGG TCTGATCGCA
GATGCGTCCT CGATCATTGC CGTTGTTGCT GGAACCGTGG GGCCAATCGG TTTTCTGGGC
CTTCAGGTCA GCTATGGCCT GTCTGACCTC TTTGGTCTCC CGGACGTCTT TGCCACGCAG
TTCATGGTGA TCGGTGGGCT TGTCGCGATC TATACGATCT CGGCGATGAC TGGTTTGTCG
CGCGGGATTC AGCTTCTGTC CAAGATCAAT GTGATCCTGG CGGCGGCGCT CTTGATTTTC
GTGCTGGTAG CTGGACCGAC GGGATTCATC TTTGGCTCCT TCTTCTCTGG CTTTGGCACA
TATCTTGGCA ACTTTTTCCA AATGGCCCTG TTCCGCGGTG ACGCAGGCGT CTTTGGCGAA
CCGGGCTGGC TGGGCTGGTG GACCGTGTTC TTTTGGGGAT GGTTCATGGG CTATGGCCCG
CTGATGGCGA TGTTCATTGC CCGTGTCTCG CGGGGCCGCT CGATCCGTTC GATCATCATC
ATGCTATCGA TCATCGCCCC GATCGTCACC AACTTCTGGT TTACCATCAT CGGTGGCACC
GGCATCTCGA TGGAGCTGGC CGCGCCGGGG ACCATTTCCA CCGCGTTCGA AGGGTTCAAC
CTGCCCGCAG CACTATTGGC GATCACCTCC AACCTGCCGA TGGGCTTTTT GATCTCGCTT
CTGTTCCTGA TCCTGACAAC GATCTTTGTG GCCACAACCG GCGACAGCAT GAGCTACGTG
ATCTCAGCCA CGATGAGCGA CGGGGAAAAT CCGCCCACAA TGGTGCGCAT TTTCTGGGGC
GTTGCGATGG GTATCATGGC TATCATCCTG ATCTCGGTCG GCTCCGGGGG CGTGTCCAAG
CTGCAGAGTT TCATAGTCGT GACCGCCGTG CCAGTCTCGC TGATTCTGCT TCCATCGCTC
TGGGACGCAT TGCGCATCAC AATTGCCAAA GGCCGCGAGC AATAA
 
Protein sequence
MSDTSLETHL KKSRLNTPLF LISGGFIALF CIAALIDLEG LSAAVDWGFN VSATYFGLYW 
QVLLLATFLI GLVLCVLPGG KAIMGNLAAP EFTLFQWGSM IMCTLLAGGG VFWAAGEPIA
HFLYSPPLYG AEGGTEAAVN PAIAQSFMHW GFLAWAILGS LSTVMLMHYH YEKGLPLAPR
TLLYPLFGDK AINGPIGLIA DASSIIAVVA GTVGPIGFLG LQVSYGLSDL FGLPDVFATQ
FMVIGGLVAI YTISAMTGLS RGIQLLSKIN VILAAALLIF VLVAGPTGFI FGSFFSGFGT
YLGNFFQMAL FRGDAGVFGE PGWLGWWTVF FWGWFMGYGP LMAMFIARVS RGRSIRSIII
MLSIIAPIVT NFWFTIIGGT GISMELAAPG TISTAFEGFN LPAALLAITS NLPMGFLISL
LFLILTTIFV ATTGDSMSYV ISATMSDGEN PPTMVRIFWG VAMGIMAIIL ISVGSGGVSK
LQSFIVVTAV PVSLILLPSL WDALRITIAK GREQ