Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3124 |
Symbol | |
ID | 4074995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 98105 |
End bp | 99355 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638004626 |
Product | hypothetical protein |
Protein accession | YP_611360 |
Protein GI | 99078102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.610314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCAG AGGTCGCAAT TATTAATAGG TCGGGCATCG CTCTGGCCGC CGACAGCGCC GTAACGATCG GCCGCGACCG GGTATGGAAG AACTCCAACA AACTTTTTCA TCTAGCACCA TCAAACGACG TTGCAGTCAT GGTCTTCGGA AGCGGGGACT ACTGCGGCTT GCCTTGGGAA GTTGTGATTA AGGAATTTAG AAAAAGCCTC GGGAAAACAA CATACTCCAG GGTAGAAGAG TATGTTGGGC GCTTTCTGGG TTTTTTGGAC GACTTGGTTG TTCCTCCAAC ACCGCTGGCC GATTTGACCG GATGGTACAT AATCCTTAAC GCCATCAGCC AGACCCAAAA AGCTATGACT GCCAGTGGTT CGCTAAAACG TCGCCAACAG CTTATCGCTG CAATCTCCGA GAAAATCGAA GAAGCGGATC ACTATCCCCT CCTCTTCGAC GGATACTCTC GTGATCAGTA CCGCAAGAAA CACTCCCAAA AGATCAAAGA GTTTATGGCG GAAGAGCTTG GAATGCATGT CACCCAGACC ATGCACTCCA AGATGATAAC GCTTTGCTAC GAACGGTCGC GGAGAGCTTT CGAAACGAAG TTCGAGACGG GGGTTGTCTT CGCCGGATAT GGCGACTCGG AACTCCTGCC TGTCGTTATC GAAATGTGTG TCGATGGCGA ATTAGAAGGG AAGGTCAGAG CCTGGCAGGT TCGTGAAAAC AACATGAATG AAGGCGGAAC TTCTGGCGCC ATCCTCCCTT TCGCACAAGC CGATGTAGCC AATCTTTTTG TGGAAGGTAC GCTACCACAA TATCTGAGTT ATACTCGACA GACCCTTCTG CAGACCCTCG ACCTGAAAAC TGCAGAACTT GTTAAAGACT ATGTCCCAGA ACCAGATCGC GTTGTCGAAA TGGAACGGCA AAAGAAAGCC AACCGCGCTA TGGTTAAGCA GTTTTCAACC GACTTCAAAC AATATCGGCA CGACGAATCT GTCGCCAACC TTCTAAAAGT GGTAAACTCT CTACCCAAAG AGGAAATGGC GGCTATGGCT GAGGCTCTTG TGGAGATTAC CTCTTTGCGG AGAAAGATGG ATTCATCACT TGAGACTGTA GGTGGCCCTG TCGACGTTGC GATTATCTCA AAGTCGGACG GGTTTGTCTG GACAAAGCGA AAGCACTACT TTGATGTTGA ATTCAACAGA GATTTCATGG AAAGGCGCAA CCAAAGGTAT CAGGGGAACC AAGATGCGTA G
|
Protein sequence | MTAEVAIINR SGIALAADSA VTIGRDRVWK NSNKLFHLAP SNDVAVMVFG SGDYCGLPWE VVIKEFRKSL GKTTYSRVEE YVGRFLGFLD DLVVPPTPLA DLTGWYIILN AISQTQKAMT ASGSLKRRQQ LIAAISEKIE EADHYPLLFD GYSRDQYRKK HSQKIKEFMA EELGMHVTQT MHSKMITLCY ERSRRAFETK FETGVVFAGY GDSELLPVVI EMCVDGELEG KVRAWQVREN NMNEGGTSGA ILPFAQADVA NLFVEGTLPQ YLSYTRQTLL QTLDLKTAEL VKDYVPEPDR VVEMERQKKA NRAMVKQFST DFKQYRHDES VANLLKVVNS LPKEEMAAMA EALVEITSLR RKMDSSLETV GGPVDVAIIS KSDGFVWTKR KHYFDVEFNR DFMERRNQRY QGNQDA
|
| |