Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_2033 |
Symbol | |
ID | 8568690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2365448 |
End bp | 2367334 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | YP_003291302 |
Protein GI | 268317583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAGA ATCGGCCCCA TTCCGAACCG TTGCCTTTCG TGGAGCCGTC AGCCGCGGGC GATGACGTCA ACGAAGCAGA ATTTTCGCTG ACCGATCTGA GCGATCCGGT CATTATCATG CTCCTGTTTC AGGAGCTGGT CCGTGAAGAG CAGGTGCGGA AGGCCTGGGA AAAGTGGCGT CAGCTCGACA GAGACCGGCG GCGGCCGCTC TGGCGCGTCC TTACCGAGAT CGACGGCATC AATCCCGAGG TCGTCTATGC GACGGCCGCC GAAGTGTACG GCTTCAAGAC GGCCCGTATC GATCGGGCTC AGGTGTTGCG TTTCCTCCGG GATCAGCGCC AGCGCTTCAC GCAGGAACAA TGGAGCTGGA TGCAGCGGGA GCGGGTGTTG CCGATCGGGC AGGAGGTGGA TGCCGAGCGC GACGTGATGC GCTGGCTGCT GGCCACGCAC GATCCGACCC GTCCCAACCT GCACCGTCAG ATCGCCCGGC TCGGGATCGA TCGCTTCGAG TTGCGCTATG CGCCGGCTTC GACGATTGAG CAGATCTTTC AGGAGGCGTT CCCCCGCCGC AATGAATACC TGGAGCGTGT GCAGCAGGAA GACGGGGCGG TCGATCTCGG AATGAGCTAT GAGGAAAATA CCGAGCTGAT CGACGAGGAG TCGCTGGAGG CTGAAATTAA CCGCTCCAAG CTGATCAACC TTTTCGAAGC GACGCTCGTC GAAGCCGTCC GCCAGGGCGC CTCCGACATC CACATCTTCC CGAACCACGA CCGCAAGGTC GAAATTCACT TCCGGATCGA CGGCGAACTG CACCGCTGGC ACCTGGAGGA CAAGGTGCAT CCCGAAGCGT TTCTGGCCGT GGTGAAGGAC CAGGCGGGTG GGGTGGACCG CTTCGAGCGG GAAAAAGCGC AGGACGGTTT CATCCAGCGC TGGATCGACG ACCACCTCAT CCGCTTCCGC GTCTCCGTGC TCCCGATCGC CACGGCCAGC TTCGATGTGC GCGCCGAGTC GATCGTCATC CGCGTGCTCG ACGACCGCAA GGTCATCAAA GACCTGCGGC TGCTGGGGCT TTCCGAGCGG GCGCTGGAGC GCTTCGAGTG GGCGATCCGC CAGCCCTACG GCATGGTGAT CGTCACCGGT CCCACCGGAA GCGGTAAAAG CACCACGCTC TACGCCGCCC TCCACCAGGT CGTCAGCCCG CGCAAAAACG TGCTGACCGT CGAAGATCCG GTCGAGTACA TCATCCCCGG CGTGCGCCAG ATCAAGCTCA GCCACAAGCT GGGGCTGGAG GACGCGCTGC GGGCCATCCT GCGACACGAC CCCGACATCG TGATGGTGGG CGAGATGCGC GACCGCCAGA CGGCCGAGCT GGCCATCAAG CTGGCCAACA CGGGGCACCT GACGTTTTCG ACGCTGCACA CGAACGACGC ACCCAGTGCG GTGAGCCGGC TCTACAAGAT GGGGATCGAG CCGTTTCTGA TCGCCTACGC GATCAACCTG GTCGTGGCCC AGCGCCTGAT CCGGAAGCTC TGTCCGAAGT GCCGGGTGCC GGACCCGGAT CCCGATCCGG TGCTGCTGAT GCGGCTCGGC TTTACGGAAG AACAGATCGC CCGCACCACG TTCTACAGAG CCGGCCAGAA TCCGCGCTGT CCGGTCTGCA AGGGGGTCGG CTACAAGGGG CGGCGGGCGA TCACCGAGAC GCTCTGGTTC TCCCGCGCCA TTCGCCACAT GATCGTGGCC GCCCGCGACG CGATCGACGA AGACGCGCTA CGCGCGCAGG CGATTAAAGA AGGCATGCAG ACGCTGCAGG AAGCCGCCCG CGAGGTCGTG CTGGCCGGCG AAACCACCAT CGAAGAAATG CTGGCCACCG TGGCCTTCGA GGGTTGA
|
Protein sequence | MSENRPHSEP LPFVEPSAAG DDVNEAEFSL TDLSDPVIIM LLFQELVREE QVRKAWEKWR QLDRDRRRPL WRVLTEIDGI NPEVVYATAA EVYGFKTARI DRAQVLRFLR DQRQRFTQEQ WSWMQRERVL PIGQEVDAER DVMRWLLATH DPTRPNLHRQ IARLGIDRFE LRYAPASTIE QIFQEAFPRR NEYLERVQQE DGAVDLGMSY EENTELIDEE SLEAEINRSK LINLFEATLV EAVRQGASDI HIFPNHDRKV EIHFRIDGEL HRWHLEDKVH PEAFLAVVKD QAGGVDRFER EKAQDGFIQR WIDDHLIRFR VSVLPIATAS FDVRAESIVI RVLDDRKVIK DLRLLGLSER ALERFEWAIR QPYGMVIVTG PTGSGKSTTL YAALHQVVSP RKNVLTVEDP VEYIIPGVRQ IKLSHKLGLE DALRAILRHD PDIVMVGEMR DRQTAELAIK LANTGHLTFS TLHTNDAPSA VSRLYKMGIE PFLIAYAINL VVAQRLIRKL CPKCRVPDPD PDPVLLMRLG FTEEQIARTT FYRAGQNPRC PVCKGVGYKG RRAITETLWF SRAIRHMIVA ARDAIDEDAL RAQAIKEGMQ TLQEAAREVV LAGETTIEEM LATVAFEG
|
| |