Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1714 |
Symbol | |
ID | 8568366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 1988709 |
End bp | 1991552 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | polysaccharide export protein |
Protein accession | YP_003290988 |
Protein GI | 268317269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.397648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCAGA AACGTCAGCG TCGGTTACCG CTGGCCGGAG CAGTGCTCTG GCTGGTTCTG TGGCTTCTGC CTTCTGTCGT TCATGCTCAG GAGATCCCCC AGCCCGTCCA GGAAGAGATC CGGCGGCGGG GCATGACCGT CGAGGAAGCC CGGCGGGAAG CCGAACGCCT GGGCATCGAC CTGTCGAATC CCGAGCAGGC CGCCCAGCGC GCCCGGGAAT TGGGCATTCC CGAAAGCCAG ATTCAGGCCA TGCTGCGCGC CGCGCAGCAG GAGCAGGCGC AGCAGCTTCC TCGTATTCTT ACGCACGGGG TCTATCCGGT GAGCTTTCAG GACACGCTGG CGCTCGATAC GCTGCAGGCG CTGGTGGACT CACTGCGGCT GCGACGCGAC TCGCTTCGAC AGCGAAAGGC GAAAGCTCCT TCTGATTCGC TTCCCTACTT CGGCTACGAT GTTTTCGAGA ACATTCCGGA CGCCTTCAAA CCCAACCAGC TGGGACCGGT CGACGATCAG TACCTGGTGG GTCCGGGCGA TGAGTTGCGC CTGATGGTCT GGGGTGCCAC GGAGTTCGCC TACGACTTGA CTGTCGACCG CGAAGGCCGC ATCTTCGTGC CCAGCGTCGG GCAGTTCACG GTGGCCGGCA AGCGGCTCGA CGTCCTGCGC GAGGAACTCA AACGCTGGTT GGCCCGCAGC TACTCCGGCT TACTGGAGGA CCCGCCCACG GTCTTCATGG ACCTGACCGT CGCGCGTCTG CAACCGGTCT ATATCTATGC CCTGGGCGAA GTGAAGCAGC CCGGCGGCTA CGTGATCGCC AGTCAGTCGA CGGTTTTCCA GGCGCTCTAC GCCGTCGGAG GCCCGAAGAT CAGCGGCTCG CTCCGCGACG TGCGCGTAGT GCGTGGTGGA AGAGTGCTGG CCCATGTGGA CCTCTACGAC TACCTGCTAC GCGGCGAGGG ACGCGAGGAC GTGCGCCTGC AGAACAACGA CCAGCTCTTC GTGCCACCAC GCGGCAAGAC GGTGGCCATC CGCGGCCAGG TGCGCCGCCC GGCTATCTAC GAACTGAAAG AAAACGAGGG ACTGCGCGAG CTCATTCAGT TTGCGGCCGG ACTCAAACCC GAGGCCTTCA CCCGCTACGT CCGCATCGAA CGCATCATCC CCTTCGAGCA GCGACAGGAC CCCTCGGTGG TGCGCGAGGT GATCACCGTG CCGCTCGACG GGGTGCTGGA CGGCTCGCGC CAGGTGCCGC TCTACGACGG GGATCGCGTC GAGGTGCTCT CGGTGCTCGA CGTGAGCCGC AACGGCGTCT ACATCAGCGG GGCCGTCGTA CATCCAGGGC TCTACGAAAT CACCACGCAG GTGCGCACGA TTCGCGACCT GATCGAACGG GCCGGCGGGG TGACCAGCGA CGTCTACGAA GGCCGCGTCC AGCTCGTGCG CTTCAAACAG AATCCGGCCG AGCGACCGCC TTCGGTGCCA GTTACCGTTG GTGATCCAGA CGACTTGGCC CTCTTGGAAA AAATGGTCAC GTTGGACCTG TCGCGTATTC TGCTGGGTGA TCCTGAACAC AACCTGGCGC TTCAACCCGG TGATCGTATT CGTGTTTACT CAGAGCTGGA TATTAATGTA CCGCGTACTG TAACAATTGA GGGTAAGGTG CGCAAACCGG GGAGTTATGC GTTGCGCGAC AGCATGACGG TCTACGACCT GCTCTTCCTG GGCGGAGGCC TCTTCGACGA AGAATTTCGC AAGGAAGTCT ATCTGGAACG CGCCGACCTG ATTCGCAAGG CCGAACACGG TACGGAAGAG ATCATCATCC CGTTTAACCT GGCCGAAGCG CTGCGTAACG AAGGGGCCGG ACGGGCGCTG CTGCAACCGG GCGATCGCAT TCGCATCTAT CCGGTCGACG TGCAGGAAAT CCGGGATAAG TTCGTCACGA TCAGCGGCGC CGTCAAAAAT CCGGGACAGT ACCGCTGGCA GGAGAATATG ACGCTGGAAG ACCTGATCCT GCGAGCCGGC GGCTTTACGG AAGACGCGCT GCTGGACTGG GCCGAGGTGA CACGTCTGCC GAAAGGGGCC GATCCGGAGC AGTTCGAGCA GCTGGCCGTG CGCATCGAGG TGCCCATGGC CGAGGATATC GACGACGTGG AGGTCGTCTC GTTCGCGCTG GACGATACGG CCCGGGCGCT GCGCGGAGCA CGGACGTTCC GGTTGCAGCA CCGCGACCGC GTCTACATCC GCAGCAATCC GGCCTTCCGG CCGCAGCAGA CCGTGACTGT CTCGGGTGAG GTCTGGTATC CGGGCACCTA CACGATCCTG CGCGAAAACG AAACGCTGGC CGACGTGCTA CAGCGGGCCG GTGGCGTGCG GCCTACAGGC TACCTGAAGG GGGCCCGCCT GATCCGCGGC GGCCTGCCCG TGGTGATCGA CATGGAGCGG GCCCTTCGGC GGGATCCCCG TCACAACGTC ATTCTCCTGC CGGGCGACGA AATCCGTGTG CCGCCCAGGC CCGGCACCGT GGTGGTACGC GGCAATGTGC GGCGTCCGGG TCTGGTGAAG CATGTCCCCG GCCGACGCGT GGGCTACTAC CTCGAACGGG CCGGTGGCCT GGACGAAGAC TCGAAAGTCA TCCTGGTCAC ACAGGCCGAC GGCGGTACCT ACCCGGTCTA TCTGGGCCTG AAGGGCTGGT TCCAGCGCGA TCCGGTGGTG GACGAAGGCG CCATCATCGA AGTCGTGCGC AAACCGCCTG AAGAAAAGCG CCAGGTGACG TTCGACATCG GCAAAACGCT GACCGACATC GCCTCGATCG CCGCCAGCAC GCTTACCATC ATCGCACTGG CCCGACGACT TTAG
|
Protein sequence | MRQKRQRRLP LAGAVLWLVL WLLPSVVHAQ EIPQPVQEEI RRRGMTVEEA RREAERLGID LSNPEQAAQR ARELGIPESQ IQAMLRAAQQ EQAQQLPRIL THGVYPVSFQ DTLALDTLQA LVDSLRLRRD SLRQRKAKAP SDSLPYFGYD VFENIPDAFK PNQLGPVDDQ YLVGPGDELR LMVWGATEFA YDLTVDREGR IFVPSVGQFT VAGKRLDVLR EELKRWLARS YSGLLEDPPT VFMDLTVARL QPVYIYALGE VKQPGGYVIA SQSTVFQALY AVGGPKISGS LRDVRVVRGG RVLAHVDLYD YLLRGEGRED VRLQNNDQLF VPPRGKTVAI RGQVRRPAIY ELKENEGLRE LIQFAAGLKP EAFTRYVRIE RIIPFEQRQD PSVVREVITV PLDGVLDGSR QVPLYDGDRV EVLSVLDVSR NGVYISGAVV HPGLYEITTQ VRTIRDLIER AGGVTSDVYE GRVQLVRFKQ NPAERPPSVP VTVGDPDDLA LLEKMVTLDL SRILLGDPEH NLALQPGDRI RVYSELDINV PRTVTIEGKV RKPGSYALRD SMTVYDLLFL GGGLFDEEFR KEVYLERADL IRKAEHGTEE IIIPFNLAEA LRNEGAGRAL LQPGDRIRIY PVDVQEIRDK FVTISGAVKN PGQYRWQENM TLEDLILRAG GFTEDALLDW AEVTRLPKGA DPEQFEQLAV RIEVPMAEDI DDVEVVSFAL DDTARALRGA RTFRLQHRDR VYIRSNPAFR PQQTVTVSGE VWYPGTYTIL RENETLADVL QRAGGVRPTG YLKGARLIRG GLPVVIDMER ALRRDPRHNV ILLPGDEIRV PPRPGTVVVR GNVRRPGLVK HVPGRRVGYY LERAGGLDED SKVILVTQAD GGTYPVYLGL KGWFQRDPVV DEGAIIEVVR KPPEEKRQVT FDIGKTLTDI ASIAASTLTI IALARRL
|
| |