Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_0031 |
Symbol | |
ID | 8566655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 28915 |
End bp | 31308 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_003289328 |
Protein GI | 268315609 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCGC TTCGCACCGC ACCGCTTTTC TGGACGGCGC TCGGCCTGGT CCTCGGGATT CTGGTGGCCG ACGCCCGGCC GATACCGGCC GGCTGGTGGC TGCTCGGCGC CGCTTGCACG GGGACGCTGG CGCTTGTTGC CCTGGTGGTG GATCAAAAAC GCCGGCGCGC TATCCCGTCG TGGCCGATCC CGCTGCTGCT GTGCGTGCTG TTTGTGGGGG CCGCCCGCCA CCGGCTTCAG ATGGCACCAC CGCCCCCTTT CCCGCTCGAC ACGACAGCGG TCGTGCGGGC CGAAGTTCAG AAGACGCCCG AAGCGACCGC GTACGGCCTG CGCTTTCCGG CCCGCACGCG CCTGCTGCTG ATCGGCCACG ACACGCTGGC CGACCGGGTG CTGCTGGACG TGCGCCTGGT GGCCGACAGC CTGCCCGCGC TTGCGTGCGG TCAGCGCGTG CTGCTGGGCG GACGACTGGA GCCGCTTCGC GTCCCCCGCA ATCCCGGGCT TCCGGACCGC ACCGCACAGT GGCGGCGACA GGGCGTAGTC GCGCGCCTGG TGGTGGACGA TCCCCGGCTC GTGCAGGTGC TCGACGCCCG CCGCTGCTTC ACAGACCGGC TGGTTTCGCT CCGGGATTCG ATCACCTCCG TGCTGCAGCG ATACATGCCC CGGCCTGAAG CCCGGCACGT GGTGCAGGCC CTGCTGCTGG GCGATCGATC GGGGCTTTCC CCGGACGTTC GCAACCGGCT CGGCCGCGCC GGACTGGCCC ACCTGCTGGC CATTTCGGGG CTGCACGTGC TGCTGGTGGG GCTGGTGCTC TACGGCCTGC TCCGTCCGCT GCTGCTACGC CTGGGGCTGA GCTGGTGGAG CATGGAATGG ACGCGCACCG TGCTGACGCT ACTCGTGCTG AGCGGCTACG TGCTGCTGGC CGGCGCTCCG GCCTCGGCCG TCCGCGCCCT GGTGATGACC GCGCTGTTTC TGGGAGCGAC GCTGTTTCAG ACGCCGCCGC ATCCGCTGAA TGCGCTGGGC GCTGCGGCCG TCGTGCTGCT GCTGGCCGAT CCTGCCCAGC TCTTCGAGCC GGGCTTTCAA CTTTCGTTTG CCGCCGTGAT CGGCCTGCTG CTGGGCTGGC TGCCCCTGCA GCAACGCCTT CCGGCCGTCA TTTTTCGCCG GCCCGCATTA CGCTATCTGA CTGGTACGCT GCTGGTGACG CTGACGGCCA CGCTGGCCAC GGCGCCCTTC GTGCTCTACC ATTTCGGCTA CGTGTCACTG GCCGGGCTGC CGCTGAACCT TCTGGGTATC CCGCTGGCCG CGGGCGCCTT GGCGGGCGGC CTGCTCTCCG TGTTGAGCGC GCCGTTCAGT CCGGCGCTGG CCGAACTTTT CGGCCAGGCC GCCACGCTGT GTGCTTACCT GCTCCTGCAA CTCGGCGAAC TGGGCCTTCG CCTGCTTCGC CCTCTGGTGC TGCACGTGCC CGAACCGCCC TGGTGGATGC TGATAGCACC GCCGGTACTG TTGGGACTGT GGAGCAGATC TCCCGGCATC CGTCGCCGGT CCGGACTGGT GCTGCTCGGC GTGCTGAGCC TCGGTCTCTG GACGACACCA CCCGCCGCGC CCCATCTGGA CGTGCTGTTT TTCGACGTCG GCCATGGTGA TGCCGTGCTG ATCCGCGCAC CCGGCGGCCG GCACCTGCTG ATCGATACGG GCGGCCGCTA TGGCGGACGC GTGGCGGCCG AATGGAGTCT ACTGCCTTTC TTCCGACGAT ACGGGATCGA TCGGCTCAAT GCCGTGCTGA TCACACACCC CGACGCCGAT CACGCCGGCG GTCTCCCGCT ACTGCTTCGC CGACTTGAAG TCGGCCGTGT GCTCGACAGC GGCACAGTTG ACTCTTCGGC CCTTTCGCTG GAAATCGCCC ATCTGATCGA CAGCCTCCGG CTTCCACGCC AATCGCTGCA GGCCGGCGAT ACCGTTCGTC TGGATCCCGC ACTGGTGCTC CAGGTACTGG CGCCCGCGCC GAATGCCAGG GACGTACCCG ACAACGAACG CTCCGTGGTG CTTCGGATGG TTTTCGGCCG GACGCGCTGG CTGTTTCTGG GCGATGCGGA ACGCGAACTG GAACGCCAGC TCACGCAGGC CTACGGCGAC CTGCTGGCAA GCGATGTGGT CAAGGTGGCC CACCACGGCT CCCGCACCAG CTCCACCCCC GAACTGGTCC AGCAAGTGAT CCCGACACGC GCACACCCCG TCCGGGCCGT GATCAGCAGC GGCTGGCGGG GCGTCAGCGA CTCGGTTCGC GTTCGGTGGG AGCGTCAGGG CGCCCGGCTG TGGATCACGG CCGATTCCGG CGCGCTCTGG CTTCGCAGCG ATGGCTGGCG GATCTGGCCG GTCGCCTGGC GCAGGGCTCA GTGA
|
Protein sequence | MPSLRTAPLF WTALGLVLGI LVADARPIPA GWWLLGAACT GTLALVALVV DQKRRRAIPS WPIPLLLCVL FVGAARHRLQ MAPPPPFPLD TTAVVRAEVQ KTPEATAYGL RFPARTRLLL IGHDTLADRV LLDVRLVADS LPALACGQRV LLGGRLEPLR VPRNPGLPDR TAQWRRQGVV ARLVVDDPRL VQVLDARRCF TDRLVSLRDS ITSVLQRYMP RPEARHVVQA LLLGDRSGLS PDVRNRLGRA GLAHLLAISG LHVLLVGLVL YGLLRPLLLR LGLSWWSMEW TRTVLTLLVL SGYVLLAGAP ASAVRALVMT ALFLGATLFQ TPPHPLNALG AAAVVLLLAD PAQLFEPGFQ LSFAAVIGLL LGWLPLQQRL PAVIFRRPAL RYLTGTLLVT LTATLATAPF VLYHFGYVSL AGLPLNLLGI PLAAGALAGG LLSVLSAPFS PALAELFGQA ATLCAYLLLQ LGELGLRLLR PLVLHVPEPP WWMLIAPPVL LGLWSRSPGI RRRSGLVLLG VLSLGLWTTP PAAPHLDVLF FDVGHGDAVL IRAPGGRHLL IDTGGRYGGR VAAEWSLLPF FRRYGIDRLN AVLITHPDAD HAGGLPLLLR RLEVGRVLDS GTVDSSALSL EIAHLIDSLR LPRQSLQAGD TVRLDPALVL QVLAPAPNAR DVPDNERSVV LRMVFGRTRW LFLGDAEREL ERQLTQAYGD LLASDVVKVA HHGSRTSSTP ELVQQVIPTR AHPVRAVISS GWRGVSDSVR VRWERQGARL WITADSGALW LRSDGWRIWP VAWRRAQ
|
| |