Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2971 |
Symbol | |
ID | 4078001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 3136295 |
End bp | 3138247 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 638008300 |
Product | hypothetical protein |
Protein accession | YP_614965 |
Protein GI | 99082811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.123475 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTTA CACCGCTTAT TCCATCATCT GGGGTTGCGG GCTGGAATTT TCTGCAATCG ACCTATGACC GGCAATACGA TGCGTTTGTC CAGTCCGGGA AGTTGAAGAA TGACAGTGAG TACTTCGCCG AGAATATCGG CGAAGTCACG TCCAGCGAAG ACTTGCTTAA TGACCGCCGC CTGCTTCAGG TGGCGGTTAA GGCGTTTGGT CTGGAAGAGG AAATCAACTA CCGGGCCTTG CTGCAGCGTG CGCTCAACGA AGGCACCTCT GCCAGCGATG CGCTTGCCAA CACAATGAAT GACGAGCGCT ACGTCGAATT CTCCAACGCC TTTGGATTCG GCCCGGGTCA ATCACCGATG ACCAGCGACA GCAAAGCAAT GCAAGCGGTG ATCGACAAGT TCCAGTCTGC CTCCTTCGAG GAAGCGGTGG GAGAAGTCGA TGAAACGATG CGCACCGCGC TCTTTGCCAA ACGGGCCATG ATCGAGGTCT TTGGCGAGCC GGACGAGGAT GACGTATCGC AGCTGAGCGT GAGAGAACGC GCCCTTCGCG AGTTTGACCT CGCCATGAAG GAGATCAACG GCAAAGACGA TGATGTCCCT GGTGTGACCT CTGTCGAGGA CCAGTGGGAA GATATTATCG AGCGCGACAC GCTTCGAGAA TTCTTTGATA CCACGCTCAG AATCTCTGCA GGTGCCGCTG GACTTGAAGA CGACGAACGC ATCCAGCTTT ATCGAGAACG TGCGCAGATT ATCTTTGGAA CCGACGATCC AACGGTCTTC TTCTCGGCCG AGAACAAAGA CACAATCATT TCCGCCTTTA AAACCCGTGC AACCGTGAAC GGGGATGATG CAGCTGAAAC CGCAAAGACT GCGGAACTGT CAGAACACAT CCTGGATCAG ATGATCTCTC GCGATGCCGC GCTGAATGAA GAATGGGATT TCATCTCCCG GCAGGAACCC CTGGCGGAGT TTATGAAAAC CGCTCTGGAA TTGCCTGACG ATATCGCGAC ACGAGAGACC AGCGAAGCCA TGCGGATCTA TCGTGAAAAA GCCATTGAAG CCTTTGGCAC AGATGACCCC AATGTCTTTG CTAGTGCAGC AAACTTGGAT GCAACACTCG AAGTTTATCG GGAAAATGCC ACGAACGCTG GTCTCTCCAG TAGCGAAATC TCTTCCAATC TGCGCACGGC TGAGACCATC CTAAAATTCT CCTACAATCA GGGGGATGCG ATCGATGGTG GTGATGCGGA TGCCGCGGCA ACCGCGGAAA TAGATGCCGG GTGGTACAGT GTCATGGGTC AAACCGGCGT CCCGCGCTTC TTGAACACGG CCCTTGATGT GTCCTCTGCG CTTGCGCCGG GTGAAATCTT TTCGGACCTG ACGATCGACG AGCAGCTAAC AATCTACAAA GATAAAGCGG TTGAACTGTT TGGGACCGCT GACCTCAAAG AGCTGACCGG CCCGTATCAA ATCGGCGCAG TGAACGATGC CTATCGCACC AACGCGGAAG CTGCGGGTGA GAGCGAATTC TTCATTACCT ATTATTCCGA TATCGCCGAG CGAGAACTAA ACACCTTGTT CCAGAGCGAC GATACCGATG CTGAAAAAGC GTTTGAAGAA GCTCTCGAAG AACTGCGCAC GATGAATGAT GAAGACAATG GTCCAAGCGC CAATTCGCAG TGGTTCACGA TCATGGGCCA GCAGGCGATG ACCGACTTCA TGCAAGTCGC GTTGGGCCTG CCCAAAGAGG TTGGTCAAAT GGATATTGAC CAGGCTGTCG AGGTCTACAA GCGCAAGGCA CAACAAGTTC TGGGGACCGA CAAACCCTCG GAATTCATCT CTGGCGACAA GATGGATGAG CTGGTCAACA TGTATCTCAC CCGCTCGCAG ATGAACAATC TCAGCAGCGG ATATAGCTCC GGCAGCGCCG CCCTCATGAT GCTGCGCGGC TAA
|
Protein sequence | MTFTPLIPSS GVAGWNFLQS TYDRQYDAFV QSGKLKNDSE YFAENIGEVT SSEDLLNDRR LLQVAVKAFG LEEEINYRAL LQRALNEGTS ASDALANTMN DERYVEFSNA FGFGPGQSPM TSDSKAMQAV IDKFQSASFE EAVGEVDETM RTALFAKRAM IEVFGEPDED DVSQLSVRER ALREFDLAMK EINGKDDDVP GVTSVEDQWE DIIERDTLRE FFDTTLRISA GAAGLEDDER IQLYRERAQI IFGTDDPTVF FSAENKDTII SAFKTRATVN GDDAAETAKT AELSEHILDQ MISRDAALNE EWDFISRQEP LAEFMKTALE LPDDIATRET SEAMRIYREK AIEAFGTDDP NVFASAANLD ATLEVYRENA TNAGLSSSEI SSNLRTAETI LKFSYNQGDA IDGGDADAAA TAEIDAGWYS VMGQTGVPRF LNTALDVSSA LAPGEIFSDL TIDEQLTIYK DKAVELFGTA DLKELTGPYQ IGAVNDAYRT NAEAAGESEF FITYYSDIAE RELNTLFQSD DTDAEKAFEE ALEELRTMND EDNGPSANSQ WFTIMGQQAM TDFMQVALGL PKEVGQMDID QAVEVYKRKA QQVLGTDKPS EFISGDKMDE LVNMYLTRSQ MNNLSSGYSS GSAALMMLRG
|
| |