Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1272 |
Symbol | |
ID | 4077432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1370689 |
End bp | 1371888 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638006580 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_613267 |
Protein GI | 99081113 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.570271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TGCTGACCGC GACGGCAGCC ATCGCATTGA GCGCTGGCAC CGCGTTTGCA GACGGCCACG GCCACCCCGA CGAAGTAAAG CTGGGTGTCC TGTTTGGCTT CACCGGCCCG ATCGAATCCC TGGCGCCGAC CATGGCCTCT GGTGCTGAAC TCGCGATGTC CGAAGTCACC GAGTCCGGCA AACTCTTTGG CGGTGCAAAA GTGACCCCGA TGCGCGCGGA CACCGGCTGT ATCGACAATG GTCTTGCGAC TGCGAACGCA GAAAAGCTGA TCGCAGATGG CGCCAACGGC ATCGTGGGTG GTGACTGTTC CGGCGTGACC GGCGCGATCC TGCAGAACGT CGCGATCCCG AACGGCATGG TGATGATTTC TCCCTCCGCA AGCTCGCCGG GTCTGACCTC GATGGAAGAC AACGGCCTGT TCTTCCGGAC CACCCCGTCT GACGCACGTC AGGGCGAGAT CATGGCGTCG ATCCTTGCAG ATCGTGGCGT CGACAGCATC GCCATCACCT ATACCAACAA CGATTACGGC AAGGGTCTGT CGGATTCGAT CAAATCCGCA TTCGAGGCCG CAGGCGGTGA AGTCACCATC GTGACCGCGC ATGAAGACGG CAAGGGTGAC TACTCTGCCG AGGTTGCGGC GCTGGCATCA GCCGGTGGCG ATATTCTGGT TGTTGCGGGC TATCTCGACC AGGGTGGTCT GGGCATCATC CAGGGCGCGC TCGACACCGG TGCGTTCGAC ACCTTTGGTC TGCCGGACGG GATGATCGGC GATTCGCTGC CCAACAACGT GGGCCCGGAC CTCAATGGCT CCTTCGGGCA GATCGCCGGC TCTGACAGTG AAGGTGCCGA GATGTTCGCT GCCAAAGCCT CCGAGCTTGG CTTTGACGGT ACTTCTGCCT ATTCGCCGGA ATCCTATGAT GCGGCAGCGC TTTTCCTGCT CGCGATGCAG GCATCGGGCT CTGTTGATCC CAAGGATTAC GTCGCCAAGA TCACCGAAGT CGCCAATGCT CCGGGTGAGA AAATCAACCC CGGTGAGCTC GGCAAAGCGC TCGAAATTCT CGCCAATGGC GGTGAGATCG ACTATGAGGG CGCAACCGGC GTCAACCTGA TCGGCCCCGG CGAGAGCGCA GGCTCTTTCC GTGAGATCGA AGTTCAGGAC GGCAAGAACG TGACCGTGAA ATTCCGCTAA
|
Protein sequence | MKKLLTATAA IALSAGTAFA DGHGHPDEVK LGVLFGFTGP IESLAPTMAS GAELAMSEVT ESGKLFGGAK VTPMRADTGC IDNGLATANA EKLIADGANG IVGGDCSGVT GAILQNVAIP NGMVMISPSA SSPGLTSMED NGLFFRTTPS DARQGEIMAS ILADRGVDSI AITYTNNDYG KGLSDSIKSA FEAAGGEVTI VTAHEDGKGD YSAEVAALAS AGGDILVVAG YLDQGGLGII QGALDTGAFD TFGLPDGMIG DSLPNNVGPD LNGSFGQIAG SDSEGAEMFA AKASELGFDG TSAYSPESYD AAALFLLAMQ ASGSVDPKDY VAKITEVANA PGEKINPGEL GKALEILANG GEIDYEGATG VNLIGPGESA GSFREIEVQD GKNVTVKFR
|
| |