Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0900 |
Symbol | |
ID | 8567538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 1016107 |
End bp | 1017966 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | CBS domain containing protein |
Protein accession | YP_003290182 |
Protein GI | 268316463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.32031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACGA CCTCGAATGA GGCCCATGCC CTCTCGCCGT TTGCTCTGCA ACTGCTGCGA GATCTTGAAG TGCTGGCGCG GATGCTGAAG GCCGGACTGA TCGAACGGGG CGTCTGCCGG ATCGGGGCCG AGCAGGAGCT GTTTCTGGTG GAGCCCTGCG GCAATCCGGC GCCGGTCGCC GAGCAGGTGC TGAAGCGCAA CCGGGATCCG CATCTGGTAC CCGAACTGAC GCGCTTCAAC CTGGAGATCA ACCTGGAGCC GGGGCTGCTG GAAGGCGACG GCCTGCGCCG CCTGGAACGA CAGCTCCTGG AGCGGCTGGC CCATGCGCGT AAGCTGACGC GCGACCTGGG CGACGACATT GCCCTGTGCG GCATCCTGCC CACCATTCAT CTCTCGGACC TGACGCTCGA CAACATGACC CCCCGGCCGC GCTACGCCCG GCTCAATGAA GCCATCACGC GCCTGCGCGG TGGTCCGGTT CAGCTTCAGA TCACGGGCAT CGACGAGTTG TTCGTGCAAC TGGACTCCAT CATGGTCGAG GGCTGTAACG CCAGCTTCCA GGTGCACCTG CAGGTCGATC CGGACGCCTT TCCTCGCTGC TACAATGCCG CTCAGCTGGC CGCCGCACCC GTGCTGGCCG CCGGGGTCAA CGCCCCGGTG CTTTTCGGCA AGCGGCTCTG GCACGAGACG CGCATCGCGG TCTTCCGGCA GGCCGTCGAC ACGCGGGGCG GCAATCTGTA CCTGCGCGAG ATGAGCCCGC GTGTGCATTT CGGCACGGAC TGGGTGCGGG AGTCGGTGCT GGAAATCTAC CGGGAAGACC TGGCCCGCTT CCGGGTGTTG CTGACGCCGG AGTCGCCCCC GGAAGACGCC TTGAACGTGC TGGAACAGGG CGGCATTCCC CGGCTGGAGG CGCTGCAGCT TTTCAACGGA ACGGTCTATC GCTGGAATCG GGCCTGCTAC GGCGTGTCGG AGGGCCGTCC GCACCTGCGC ATCGAAAACC GCGTGCTTCC TTCCGGTCCC ACGCCCGCCG ACGAGGTGGC CAATGCGGCC TTCTGGGCGG GGCTCGTGCT GGAGCTGGCA CACCGCGAGC CGGAGCTGCA CCGGCGCATG GATTTCGACG AGGCCCGGGG CAACTTCATC GCTGCGGCCC GGCTCGGGCT GGGCGCGCAT CTGACCTGGC TGGACGGCGC GCGCTGGCCG GCCCAGCGGC TCATCCTGGA GGAACTGTTG CCGATGGCGC GGGCCGGACT GCAACGCGCG GGGGTCGATC CGGCAGACAT CGCCCATTAC CTGGGGCTCA TCCGCGAGCG GGTGGAAAGC CGCCAGACCG GCGCCGAATG GCAGCTTCGT TCGCTGGCGG CGATGAAAGG GCAGGGGAGC CGCGCCGAAC GCCTGGAGGC GCTGGTGCTC ACCATGCAGG CCTATCAGCA GGAGAATCGG CCGGTGCATC GCTGGGAGCC GGCGCGGCTG GCCGCGGTGC GGCGGCGCGC AGCGCAGGGC AACCGCCGCG TGGAGCACGT CATGACCACC GATCTGTTCA CCGTGCAGGA AGACGAGCCG CTGCAGTTCG TGGCGGCGCT GATGGACTGG AAGCAGCTGG ACGCCGTGCC CGTCGAGGAC GTCCACCACC GACCCGTCGG TCTGGTGCAC CGTGAGGCGG TGCAGCAGGC GCTTCAGGAA AAAGATCCGA CGACGGCCGT TCGGCTCGTG ATGAATCCAC GACCGATCGT GGTCACACCC GAAACGCCAC TGACCGAGGC GATGCGGTTG CTGGAGGCGC GGAAGGCCGC GGCGCTACTG GTCGTGCACC GGGAGCAGCT CGTCGGCATG CTCACCCGCG CCGAGCTTCC CGCTTTCTGA
|
Protein sequence | MATTSNEAHA LSPFALQLLR DLEVLARMLK AGLIERGVCR IGAEQELFLV EPCGNPAPVA EQVLKRNRDP HLVPELTRFN LEINLEPGLL EGDGLRRLER QLLERLAHAR KLTRDLGDDI ALCGILPTIH LSDLTLDNMT PRPRYARLNE AITRLRGGPV QLQITGIDEL FVQLDSIMVE GCNASFQVHL QVDPDAFPRC YNAAQLAAAP VLAAGVNAPV LFGKRLWHET RIAVFRQAVD TRGGNLYLRE MSPRVHFGTD WVRESVLEIY REDLARFRVL LTPESPPEDA LNVLEQGGIP RLEALQLFNG TVYRWNRACY GVSEGRPHLR IENRVLPSGP TPADEVANAA FWAGLVLELA HREPELHRRM DFDEARGNFI AAARLGLGAH LTWLDGARWP AQRLILEELL PMARAGLQRA GVDPADIAHY LGLIRERVES RQTGAEWQLR SLAAMKGQGS RAERLEALVL TMQAYQQENR PVHRWEPARL AAVRRRAAQG NRRVEHVMTT DLFTVQEDEP LQFVAALMDW KQLDAVPVED VHHRPVGLVH REAVQQALQE KDPTTAVRLV MNPRPIVVTP ETPLTEAMRL LEARKAAALL VVHREQLVGM LTRAELPAF
|
| |