Gene Rmar_2444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_2444 
Symbol 
ID8569110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2839523 
End bp2842528 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content63% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003291710 
Protein GI268317991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTGCAC GGAAACGTCT GACACTGAAG CGCTGGCTGG CGCTGGGACT GTGGTTGCTG 
CTGGTGGGGG CGGCCCACGG GCAGGTGATC CGGGGAACGG TGACCGACCG GGACACGGGC
GATCCGCTTC CGGGTGCCAA TGTCGTGATC AAAGGCACCG TGCGGGGGGC CGCCACCGAC
GTGGACGGTC GGTTTGTCAT CCCCAACGTG GCGCCCGGCC AGTATACGCT GGTGATTTCG
TACCTCGGCT ATCACACCGA AGAGGTGCCG GTGACGGTCG CGGAGGGCGC TTCGGAGGTC
GAGGTCAACG TCCAGCTTGT CTGGGAAGGG ATTGTCGGGC AGGAGGTGGT CATCACGGCG
CAGGTGGCCG GACAGCTGGC GGCCATCAAC GAGCAGTTCT CGGACGTGGT GGTCAAGAAC
GTGGTCTCGC GTGACCGCAT TCTGGAGCTG CCCGACAACA ACGCGGCCGA GTCGATCGGA
CGTCTGCCGG GCGTCGTGAT CCTGCGCTCC GGCGGTGAAG CCACGAACGT GGCCATCCGC
GGCCTGCTGC CCAAATACAA CACGGTCACG GTCAACGGCG TGCGGTTGCC CGATACCGAC
CCCTCCAATC GCCAGGTCGA TCTGTCGCTG ATCTCGTCGA ACATCCTGGA CGGGATCGAG
GTGCGCAAGG CGATCACGCC GGACATGGAC GCCGACGCCG TGGGCGGCAA CATCGACCTG
CGGCTTCGGA GCGCGCCCTC GGGCTGGCAT TTCGACGTGC TGGCCACCGG TGGCTATGCC
GGGCTGCAGC AGTATCTGGG GAACTACAAG CTTGTCGGAA CAGCCAGCAA CCGCTTTCTG
AACGAGCGGC TGGGTGTGAT CGCCACGTTC AACGCGGATC GCTACAACCG GAGCGCCGAC
AAGCTGGGCA TCAACTGGAC GGCCGACGAC ATCAACCCGC TGACCGGCCA GCGTGAACCC
CGGTTCAACA GCTTCAACCT GCGGGAAGAG ACCGTCTTCC GCGGACGTCT GGGCGGCAGT
CTGCTGCTGG ACTATAACAT TCCCAACGGG CGCCTGCAGG GCAACATTTT CTACAACGAG
CTACGCGACG ATGTATTTGT GCGCCGTTAT GCCCCTTCGG TCGGCAGTCT GGATGCCAGT
GTGGAGGAGT ATGACACGCG CACGGCCATT CTCACCAGTG GGCTGGGGAT CGAGCAGGAC
CGGGGCAGCT TCAAATACGA CGCCCAGGCC TTTTATACCA TCTCCCGGCG TCGGTCGCCG
CACAACTACA TCTGGGAGTT CGGGCGCGAC GGCACGGCGC TGAGCGTGGG GCGGGCCGAG
CTGTTCGGCC TGTCGCCCGA CAGCGTCTGG AGTCTGGTGC GTCACGATTC GACCATGCAG
CTCTCCTCGA TCTGGGTGGA TTCCGAGCGC CTGGACGAGG ACCAGTACGG CATTCAGGCC
AATTTCCAGC GGCCGTTCCA TTTCGGATGG ATTTCCGGCT ACGTGAAGCT GGGCGGTAAG
CTGCGCTGGC TGTCGCGCAC GTTCGACCGT GAGCGCAACG GCCGGCAGGG CCTGCGGTAT
CCGAGCGATA CGGTCGAGCA GTGTCTGCTG GAGACGCTGG GCCCCGAGTG GGAGGAACGC
TATCAGATTG CCGACTCGGT CTATGGCGTG CCGGGGCTTC CGATCGCGCT CATCCAGAAG
GATTACAAGC GCGAAGGGGA GTTTGGCGAA GGACAGTTCG GGCTGGGACC GATCGCCGAC
GAAGACCTGC TCATGGAGCT GACGCGGGCG CTGCAGGCTT CGCCCTGCCA GGCCGAATAC
CAGAACAACA CGATCGAGTC GCTGGGCCGT GATTACGACG GCATCGAGCG GTATCAGGCC
GCCTATGTCA TGGCCCAGCT CAAGATCGGG CCCTACGTCA CGCTGATTCC GGGTATTCGC
TACGAGCGGG ACTACTCGCG CTATACGGGA CAGCGCTTCC GGGAAGTTAC GTCCGGTTTT
GTGTACGCGC CCCCGGCCGA TCTGGCCGAA CTGGAGGTGG AGCGCGAGAA CATCTTCTGG
CTGCCCATGG TCCATCTGGA CGTGCGGCCA CTCGACTGGC TGGCCCTCAA GCTGGCCCGC
ACCGAGACCA TCTCGCGGCC CAACTACTAT CAGTACGCGC CGATCACCTC GATCAACACG
TGGCGCTCGT TCATCTGGGC GGCCAACTCG AAGCTGCGTC CCTCGCATGC CACGAACTAC
GACGCCACGC TGCAGCTGGC CAGCTCCCGC TTCGGATTGT TCGGAATCTC GGCCTTCTAC
AAACGCATCG ACGATCTGCT GATCGAGGTC GAATTCCCCG CCCAGCTGTT CCGGGATGTC
AACGGCGATA CGGTGGTGAT CGGTGTGCCG GAAGGCACGA ACGTGCCGCG TGAGTGGCTG
GAAGGTGCCA GCCCGCAGCT CCAGACCACG GTCAACAACG ACGAACCGGC CAAATACTGG
GGCTACGAGC TGGAATGGCA GACCAACTTC TCCTATCTGC CCGGTGTGCT CAAAGGCCTC
GTGCTGAGCC TGAACTACAC GCGGGGCTTT TCGGAGACGA CCTATCACTA CTACCGCAAA
GAGCGGCAGA TTCTGCCGGG ACGTCCCCCG CGTACCATCT ACTCCATCAT CGATACGACC
CGCACCGGGC GCATGCCGGG ACAGGCGGCC CACGTGTTCA ACATGACCAT CGGCTTCGAT
TACCGCGGCT TTTCGGCGCG GCTGTCTTAC CTGTACCAGA GCGATATTGC CAGCTGGGTC
AACCCGCGTG AGCCGTTGAA CGATGTGTTC GTGGGGCCCT ATTCCCGATT CGATCTGTCG
GTGCGGCAGA AGATCGGGAC GGGAATGGAG CTCTATGCCA ACTTCAACAA CCTGAATAAT
CGACCCGACG AGCAATACAC CGGCCAGAAT ACGCAGGACC CGGATTATAG CTTTACGCGC
CGCTATCTGG CCTACAAAGA ACTTTACGGC TATACGATCG ACGTCGGATT CCGCTATCGA
TTCTGA
 
Protein sequence
MCARKRLTLK RWLALGLWLL LVGAAHGQVI RGTVTDRDTG DPLPGANVVI KGTVRGAATD 
VDGRFVIPNV APGQYTLVIS YLGYHTEEVP VTVAEGASEV EVNVQLVWEG IVGQEVVITA
QVAGQLAAIN EQFSDVVVKN VVSRDRILEL PDNNAAESIG RLPGVVILRS GGEATNVAIR
GLLPKYNTVT VNGVRLPDTD PSNRQVDLSL ISSNILDGIE VRKAITPDMD ADAVGGNIDL
RLRSAPSGWH FDVLATGGYA GLQQYLGNYK LVGTASNRFL NERLGVIATF NADRYNRSAD
KLGINWTADD INPLTGQREP RFNSFNLREE TVFRGRLGGS LLLDYNIPNG RLQGNIFYNE
LRDDVFVRRY APSVGSLDAS VEEYDTRTAI LTSGLGIEQD RGSFKYDAQA FYTISRRRSP
HNYIWEFGRD GTALSVGRAE LFGLSPDSVW SLVRHDSTMQ LSSIWVDSER LDEDQYGIQA
NFQRPFHFGW ISGYVKLGGK LRWLSRTFDR ERNGRQGLRY PSDTVEQCLL ETLGPEWEER
YQIADSVYGV PGLPIALIQK DYKREGEFGE GQFGLGPIAD EDLLMELTRA LQASPCQAEY
QNNTIESLGR DYDGIERYQA AYVMAQLKIG PYVTLIPGIR YERDYSRYTG QRFREVTSGF
VYAPPADLAE LEVERENIFW LPMVHLDVRP LDWLALKLAR TETISRPNYY QYAPITSINT
WRSFIWAANS KLRPSHATNY DATLQLASSR FGLFGISAFY KRIDDLLIEV EFPAQLFRDV
NGDTVVIGVP EGTNVPREWL EGASPQLQTT VNNDEPAKYW GYELEWQTNF SYLPGVLKGL
VLSLNYTRGF SETTYHYYRK ERQILPGRPP RTIYSIIDTT RTGRMPGQAA HVFNMTIGFD
YRGFSARLSY LYQSDIASWV NPREPLNDVF VGPYSRFDLS VRQKIGTGME LYANFNNLNN
RPDEQYTGQN TQDPDYSFTR RYLAYKELYG YTIDVGFRYR F