Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3178 |
Symbol | |
ID | 4075348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 161441 |
End bp | 162601 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004681 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_611414 |
Protein GI | 99078156 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.677267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.56284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCTCA AGAAAACGAT GCTTGCCGCT GCTGCGGCTT TGGCATTTCC CCTCATTGCG TCCGCCGAGC AAGGCGTGAC TGGCGACAGT GTCACATTTG CACAGGTTGC CGCCTTTGAT GGGCCAGCCG CTGCACTCGG CACCGGCATG CGCCTTGGCA TTACCGCAGC CTTTGAGGAA GCAAACGCCG CAGGTGGTGT GCACGGGCGG ATGCTGAAAC TCGACAGTAT GGATGACGGC TACGAGCCCG ACCGCTCTGC CGCTTTGGTC AAGACCGTGA TCGAAGGCAA TGGCCATATT GGTCTGATTG GCGCGGTGGG CACCCCGACC TCCTCTGCGA CGCAGCCCAT CGCTACCGAG GCCAATGTTC CCTTCATCGG CCCCTTCACC GGCGCGGGCT TCTTGCGCGA CGCCTCTCAT GGCAACATCT ACAATGTGCG CGCCAGCTAT TTTGCGGAAA CCGAAGCCTG GATCGAATAT CTCGTCGATC AGCAAGGCAT GAAGTCGATC GCGATCCTCT ATCAGGACGA CGGCTTTGGC CGCGTGGGGC TGAACGGCGT CACCGCTGCG CTTGAAAAAC GCGGCATGAG CCTCGCGGCA GAAGGCACAT ATACCCGCAA CACCACCGCC GTCAAAAAGG CGCTGCTGGC GATCCGCAAG GCGAAGCCCG ATGCGGTGGT CATGGTCGGC GCCTATAAAC CGGTGGCCGA ATTCATCAAA CTCGCGCGCA AAATGAAGCT CGACTCGGAG TTCGTGAATA TCTCCTTTGT CGGCTCTGAC GCTCTGGCAC AGGAATTGGG CGAGCATGGC GAAGGTGTGA TCATCAGCCA GGTGGTGCCC TTCCCGTGGG ACATGTCGAT CCCGGTTGTC GCGCAATATA CCGAAGCCCT GAAGGCCGTG GATGCCGCCG CCAAGCCCGG CTTTGTGTCG CTTGAAGGCT ATATCGTCGG TCGTCTCGCC ATTGCCGGTC TCGAAGCCGC AGGCAAGGAG CTGACCCGTG ACTCCTATCT TGCCGCTCTG GCAGGACTCT CCACGGTCGA TCTCGGCGGT GTCAGCATGG TCTTTGGTGC GGACGACAAC CAGGGCATGG ATGACGTGTT CCTGACCCGT ATCACGGCAG ACGGCCAGTT CGAGCCCATC GTATCCGGCG GCGGCTCCTA A
|
Protein sequence | MFLKKTMLAA AAALAFPLIA SAEQGVTGDS VTFAQVAAFD GPAAALGTGM RLGITAAFEE ANAAGGVHGR MLKLDSMDDG YEPDRSAALV KTVIEGNGHI GLIGAVGTPT SSATQPIATE ANVPFIGPFT GAGFLRDASH GNIYNVRASY FAETEAWIEY LVDQQGMKSI AILYQDDGFG RVGLNGVTAA LEKRGMSLAA EGTYTRNTTA VKKALLAIRK AKPDAVVMVG AYKPVAEFIK LARKMKLDSE FVNISFVGSD ALAQELGEHG EGVIISQVVP FPWDMSIPVV AQYTEALKAV DAAAKPGFVS LEGYIVGRLA IAGLEAAGKE LTRDSYLAAL AGLSTVDLGG VSMVFGADDN QGMDDVFLTR ITADGQFEPI VSGGGS
|
| |