Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1467 |
Symbol | |
ID | 4077764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1567246 |
End bp | 1569153 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638006778 |
Product | hypothetical protein |
Protein accession | YP_613462 |
Protein GI | 99081308 |
COG category | [S] Function unknown |
COG ID | [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGC GGGCCAGACG CAAGCAGACC TTTGCGAATT CCCTGAGGGT GGTGACGCCC CTGCTCATTA TGGCGGGCTG TTTGTTCGCC CTGACCCGAC AAGCGGATCT GCCGCATTTC CACGACCTTC TGGGCCTGCT CACGCAGGTG CCCGCGCCGC ATTGGATCGG CGCCCTCGGG GCAACTGTGC TCAGTTTCTG GGCTCTTGGC CGCTATGACG CGGTCGCACA CCGGCATTTG CGCAGCGGAA TCGATGACCG GACCGCACGG CGCGCGGGCA TGGCCGCTAT CGCCTTTTCG CAGGCCGTTG GATTTGGCTT GTTTTCGGGG TCCTTTGCAC GTTGGCGCCT GCTGCCGCAA CTGAATCCAT TGCTTTCGGC GCAGCTGACC GGGTTTGTGG GCATCACCTT CATGACAGCC CTCGCTGTCA TTTGCGGGAT CTTTCTGATC CTTGTGGGAC CTTCATGGGG GATGCGCCTT GTGGGCGGGG GCATCCTGTT TGCCGCGATC GCCGCTGTTG GCCTGTGCTT CTTGCACCCA GAGTGGCGCA TCCGTGGTTT GCGGCTCCGG TTCCCTTCTG TGCAGGCCAT CCTTGCGCTG GCGCTTTGGA CCATGATGGA TGTGACCTTT GCCGGGGTCG CCCTCTGGCT GTTGCTGCCG GTTGGACATG GCATCGGCCT TGATGTCTTG CTGACCGCCT ATTTTCTGGC CCTTGGACTG GCGATCATCT CCTCCTCTCC GGGCGGAGCC GGGCCGCTGG AACTTGCAAT GCTCACGCTT CTGCCCGGTG CCGATCCCGC GACGCTGGTG GCAGGACTTC TCGCCTTTCG GGCAGTTTAT TATGCGCTGC CCGCGATGCT TGCGGGTGCT GTGTTGCTCT GGCCACGGCT GCTGCGCCAC GGAAAGGCAA TGCCGGACCC CTGGGAGACT GGCGATCTGG GCTGCGATCT GCGGCCTGCT GCCAGCCAGC CCTTCGTCCG CCCGCAGGCC GAAACGGCCG TGCTCTTGCA AAACGGCGGC CATGTGATGG CCTTTGGGCT CAATCAAGTG GCGCTCCTTG ATAGCCCGCA GCTCTCTGCG GTGCTATTTG ATCCAATCAG CGGCCGACAA GACGAGATCC CCGCCGCCCT GCGCGCCCAT GCACTCTCGC GCAATGCGGC AGCTTGCTTT TACAAATGCA GTGCCCGCAC CGCGCTGGCA GCCCGTCAGG AGGGTTGGAA GATCCTGAGA GTGGCTCAGG ACGCCATCCT TGCGCCGGAG ACCTTCACCG TCGAGGGCTC CAAGCATCGC CAACTGCGTC GCAAACTGCG CCACGCCGAG AAGGCCGGAC TGCAGGTGGA GCCCGCCTGG GGGACGCTGC CCTTGACCCA GATGGCCCAG GTTGATGCCG CATGGACGCG CCAGCACGGC GGTGCCCGTG GCACGACGAT GGGCCAGTTC GAGCCGGGAT ATGTGGCGAT CCAGATGACC TGTCTTGCCT GGCTTGAGGA CCGCCTTGTC GGCTTCATGA CCTTTCACCG GGCGGCGGAT GAATGGTGCC TTGATCTGGT GCGCCAATTG CCCGGGGCGC CCGATGGCAC CGCCCATGCT ATGATTTGCA CCGCGGTCGA GGCCGCGCGC GATGCGGGCG TGCGCCGCCT GTCGCTTGCG TCGGTTCCCG ATCACCGCTT CAGCGCGCGC TTTGATGGCG GCCTGCGTCA GTTCAAGTCC TGCTTTGCCC CCACCTGGGA GGCCCGATAC ATGGCGGCGC CAAGCTGGGC TCAGATGGGG CTCGCGATTG CCGAAATGAC CCGGCTGGTG CATCGTCCGG CGCGTCCGGA GGCTGCAATG GCGCAGATCG ACATGCTGCA GGACCCTATT CCTGATGATA CTGTCGAAAA TGCAGTTGCG GCAAAACGGA CCGCGTGA
|
Protein sequence | MPMRARRKQT FANSLRVVTP LLIMAGCLFA LTRQADLPHF HDLLGLLTQV PAPHWIGALG ATVLSFWALG RYDAVAHRHL RSGIDDRTAR RAGMAAIAFS QAVGFGLFSG SFARWRLLPQ LNPLLSAQLT GFVGITFMTA LAVICGIFLI LVGPSWGMRL VGGGILFAAI AAVGLCFLHP EWRIRGLRLR FPSVQAILAL ALWTMMDVTF AGVALWLLLP VGHGIGLDVL LTAYFLALGL AIISSSPGGA GPLELAMLTL LPGADPATLV AGLLAFRAVY YALPAMLAGA VLLWPRLLRH GKAMPDPWET GDLGCDLRPA ASQPFVRPQA ETAVLLQNGG HVMAFGLNQV ALLDSPQLSA VLFDPISGRQ DEIPAALRAH ALSRNAAACF YKCSARTALA ARQEGWKILR VAQDAILAPE TFTVEGSKHR QLRRKLRHAE KAGLQVEPAW GTLPLTQMAQ VDAAWTRQHG GARGTTMGQF EPGYVAIQMT CLAWLEDRLV GFMTFHRAAD EWCLDLVRQL PGAPDGTAHA MICTAVEAAR DAGVRRLSLA SVPDHRFSAR FDGGLRQFKS CFAPTWEARY MAAPSWAQMG LAIAEMTRLV HRPARPEAAM AQIDMLQDPI PDDTVENAVA AKRTA
|
| |