Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2997 |
Symbol | |
ID | 4078027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3164896 |
End bp | 3165993 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638008326 |
Product | hypothetical protein |
Protein accession | YP_614991 |
Protein GI | 99082837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.516476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTGGC TCGGTCTATG GCATAAAGAG GCGCAGTATT TCGACGCCGC TGGACTCACC CCGTCGGTAG AACGCAGCTG TCTCTTGTTG CGCGAAGCCG ATGCACTGTT GCCGCGGGGC GCGCTGGTTC TGGAGGTGAG CCTGCCGGAA TTGCGCAAAC CCGAACCCCT GGTGCTGTTT GAGCGTGGAG GCGACTGGCC TTTGCGATTT CAGCTCTCGG CGGTGCCGGG GGGCGGTATC AATCTGGTGC TGGAGCAATA TGGCGCGGTG TTTCACCAAA CGCTGAACCC GACCAAGCGG GGGCGTGCGG ATCAGGTGCG GCTGACCTAC AACTGGGACG CTCCGGCGTT TGAAGGCCAG CTTGCGCTCG AATGGCTTGA TGGAGATCGG GCCGAAATCG CGGATATTCA TGCCCCACGC CCCTGGCGGC TTGCGGATCT TGAGGCTTTG ATTGAAGGGG GGCCGCACTG CTTTATCGCC TCCGGCGTCG AATACATTGC ACTTTCTGAT CAGCCGGAGC CGGTGGGGCC GATGCCCGGC CTCTCGCCGT TTACACCGGT GGAAACGGAA ACGGGGGCGC GACCGATCAA GGACCTGCGT CGAGGTGACT TGCTGCGCTG CGCAAGTGGT GATTTGGTCC CCGTCTTGCA TAAAATCGAA CGTGAAGTGC CAGCGGTGGG CAGCTTTTGT CCCGTGCAGC TTCGGGCGCC ATATTTCGGC CTGACACAGG ATATTACCGT CGCGCCTTTC CAACGCATGG TGCTCACCGG GTCTGAGGTT GAATATCTCT TTGGATGTGA GGCTGTGCTT GCGCCGGCAG AAATGCTTGC GGCCACGCGC ACCGCCCGTC GGGTCTTGCC CGCGGGGCCG ATCACGACCT ATGCGCAGGT GATCCTGCCC GGCCATGAGG TGCCAGTTGT GGCGGGGCTC GGGGTCGAGA GCCTTTTTCT GGGGCGCATT CGGCGCGACC GTTCCGCACT TGGGGCCAGC CTCTTTGCAG GGCTAGATCG CAACTCTCTG CCAGAACACG CACAGCCGCG CTACCCCGTG GTGCGCGCGT TTGATGCCGC AATCCTTGCA GAACATCGCA CCGCCTGA
|
Protein sequence | MSWLGLWHKE AQYFDAAGLT PSVERSCLLL READALLPRG ALVLEVSLPE LRKPEPLVLF ERGGDWPLRF QLSAVPGGGI NLVLEQYGAV FHQTLNPTKR GRADQVRLTY NWDAPAFEGQ LALEWLDGDR AEIADIHAPR PWRLADLEAL IEGGPHCFIA SGVEYIALSD QPEPVGPMPG LSPFTPVETE TGARPIKDLR RGDLLRCASG DLVPVLHKIE REVPAVGSFC PVQLRAPYFG LTQDITVAPF QRMVLTGSEV EYLFGCEAVL APAEMLAATR TARRVLPAGP ITTYAQVILP GHEVPVVAGL GVESLFLGRI RRDRSALGAS LFAGLDRNSL PEHAQPRYPV VRAFDAAILA EHRTA
|
| |