Gene TM1040_3178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3178 
Symbol 
ID4075348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp161441 
End bp162601 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content60% 
IMG OID638004681 
Productextracellular ligand-binding receptor 
Protein accessionYP_611414 
Protein GI99078156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.677267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.56284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTCA AGAAAACGAT GCTTGCCGCT GCTGCGGCTT TGGCATTTCC CCTCATTGCG 
TCCGCCGAGC AAGGCGTGAC TGGCGACAGT GTCACATTTG CACAGGTTGC CGCCTTTGAT
GGGCCAGCCG CTGCACTCGG CACCGGCATG CGCCTTGGCA TTACCGCAGC CTTTGAGGAA
GCAAACGCCG CAGGTGGTGT GCACGGGCGG ATGCTGAAAC TCGACAGTAT GGATGACGGC
TACGAGCCCG ACCGCTCTGC CGCTTTGGTC AAGACCGTGA TCGAAGGCAA TGGCCATATT
GGTCTGATTG GCGCGGTGGG CACCCCGACC TCCTCTGCGA CGCAGCCCAT CGCTACCGAG
GCCAATGTTC CCTTCATCGG CCCCTTCACC GGCGCGGGCT TCTTGCGCGA CGCCTCTCAT
GGCAACATCT ACAATGTGCG CGCCAGCTAT TTTGCGGAAA CCGAAGCCTG GATCGAATAT
CTCGTCGATC AGCAAGGCAT GAAGTCGATC GCGATCCTCT ATCAGGACGA CGGCTTTGGC
CGCGTGGGGC TGAACGGCGT CACCGCTGCG CTTGAAAAAC GCGGCATGAG CCTCGCGGCA
GAAGGCACAT ATACCCGCAA CACCACCGCC GTCAAAAAGG CGCTGCTGGC GATCCGCAAG
GCGAAGCCCG ATGCGGTGGT CATGGTCGGC GCCTATAAAC CGGTGGCCGA ATTCATCAAA
CTCGCGCGCA AAATGAAGCT CGACTCGGAG TTCGTGAATA TCTCCTTTGT CGGCTCTGAC
GCTCTGGCAC AGGAATTGGG CGAGCATGGC GAAGGTGTGA TCATCAGCCA GGTGGTGCCC
TTCCCGTGGG ACATGTCGAT CCCGGTTGTC GCGCAATATA CCGAAGCCCT GAAGGCCGTG
GATGCCGCCG CCAAGCCCGG CTTTGTGTCG CTTGAAGGCT ATATCGTCGG TCGTCTCGCC
ATTGCCGGTC TCGAAGCCGC AGGCAAGGAG CTGACCCGTG ACTCCTATCT TGCCGCTCTG
GCAGGACTCT CCACGGTCGA TCTCGGCGGT GTCAGCATGG TCTTTGGTGC GGACGACAAC
CAGGGCATGG ATGACGTGTT CCTGACCCGT ATCACGGCAG ACGGCCAGTT CGAGCCCATC
GTATCCGGCG GCGGCTCCTA A
 
Protein sequence
MFLKKTMLAA AAALAFPLIA SAEQGVTGDS VTFAQVAAFD GPAAALGTGM RLGITAAFEE 
ANAAGGVHGR MLKLDSMDDG YEPDRSAALV KTVIEGNGHI GLIGAVGTPT SSATQPIATE
ANVPFIGPFT GAGFLRDASH GNIYNVRASY FAETEAWIEY LVDQQGMKSI AILYQDDGFG
RVGLNGVTAA LEKRGMSLAA EGTYTRNTTA VKKALLAIRK AKPDAVVMVG AYKPVAEFIK
LARKMKLDSE FVNISFVGSD ALAQELGEHG EGVIISQVVP FPWDMSIPVV AQYTEALKAV
DAAAKPGFVS LEGYIVGRLA IAGLEAAGKE LTRDSYLAAL AGLSTVDLGG VSMVFGADDN
QGMDDVFLTR ITADGQFEPI VSGGGS