Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0124 |
Symbol | |
ID | 4078729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 131939 |
End bp | 132949 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005411 |
Product | extracellular solute-binding protein |
Protein accession | YP_612119 |
Protein GI | 99079965 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA AAGTCTTCAC TACCGTAATC GCCGCGAGCC TTGCTACGAC CGCCGTTGCT GACGGGGTTG TGAACCTGTA CTCGTCCCGC CATTACGACA CCGACGAACG CCTCTACACC GACTTTGAAG AGGCTACCGG TATCACCATC AACCGCATCG AAGGCAAAGC CGATGAGCTG GTCGCGCGCA TGCAGGCCGA AGGGGCCAAC TCTCCTGCGG ATGTTCTGAT CACCGTCGAC ACCTCCCGCC TTGAGCGCGC GAAGAACGCC GGTGTGCTTC AGTCCATCGA CAGCGACATT CTTGAAGAGC GCATCCCCGC CAACCTGCAA GATAGCGACA ACCAGTGGTT TGGTTTCTCT CAGCGTGCCC GCATCGTCTT CTATGACAAG ACTGACGTGG CCAACCCGCC CGCAGACTAC ATGGATCTTG CCAAGCCCGA ATACAAAGGC ATGGTCTGCC ACCGATCGTC TTCCAATGTC TACTCCCAGA CCCTGCTGTC GGCCATCATC GAGAACCACG GTGAAGAGGC GGCACGCGAT TGGGCAGAAG GCATCGTCGC AAACTTTGCC CGCGATCCGC AGGGTGGCGA TACCGACCAG CTACGCGGCC TGATCTCCGG CGAGTGCGAC GTGTCGATTG CAAACACCTA TTATTTTGCC CGTGCCCTGC GCAAAGACGT CAAAGGCCTC TCGGCTGAGA TCGAGAAGAT CGGCGTCGCC TTCCCGGCTC AGGACGCTGA AGGCGCCCAC ATGAACCTCT CCGGCGCCGG TGTTGCAGCA CATGCACCGA ACCGTGAGAA CGCCATCAAA TTCCTCGAGT ACCTGGCTTC CGATCAGGCG CAGGAATATT TCTCCGGCGG CAACGATGAA TTCCCGGCGG TCCCGGGCGT CAGCAAGTCG GAAAGCGTTG CACAGCTCGG CGAGTTCAAG GCCGACGACG TGGACCTCTC CAAGGTCGCC AAGAACGTGC CGACCGCACA GAAGATCTTT AACGAGGTTG GCTGGGAATA A
|
Protein sequence | MKIKVFTTVI AASLATTAVA DGVVNLYSSR HYDTDERLYT DFEEATGITI NRIEGKADEL VARMQAEGAN SPADVLITVD TSRLERAKNA GVLQSIDSDI LEERIPANLQ DSDNQWFGFS QRARIVFYDK TDVANPPADY MDLAKPEYKG MVCHRSSSNV YSQTLLSAII ENHGEEAARD WAEGIVANFA RDPQGGDTDQ LRGLISGECD VSIANTYYFA RALRKDVKGL SAEIEKIGVA FPAQDAEGAH MNLSGAGVAA HAPNRENAIK FLEYLASDQA QEYFSGGNDE FPAVPGVSKS ESVAQLGEFK ADDVDLSKVA KNVPTAQKIF NEVGWE
|
| |