Gene Rmar_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1803 
Symbol 
ID8568455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2111746 
End bp2112867 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID 
ProductPrephenate dehydrogenase 
Protein accessionYP_003291075 
Protein GI268317356 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.411636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAC GCATTACGAT CTGCGGGCTG GGCCTGATCG GCGGCTCGCT GGCGATGGCC 
TGGAAGCGGG CGCGGCCGGA GCTGCATCTG ACGGCTTTCG ACCGGCGCGA GGTGCTTCGC
CAGGCCCGTG AGCTTGGCGT CGTGGACGCC ACTGCCGAAG ATGTGGCCGA GGCCGTCGCC
GAAGCCGATC TGGTGGTGCT GGCCGCGCCG CTGCGCGGCA TTCTGTACCT GCTGGAGGAG
ATCGGCCCGC ATCTGAAGCC GGGCGCCCGG GTGACCGACG TCTGTGGCGT CAAGCGCCCC
ATCATGGCGC ACGCCCGGGA GATGCTGCCC GAGACGGTCA CCTTCATCGG CGGGCACCCC
ATGGCCGGCT CCGAACGCCG CGGCCTGGCC AATGCCGATC CGTTTCTGTT CGAGAACGCC
ACCTACGTGC TCTGTCCGCC CCCCGGGAGC GACGCTGTCC GACTTCAGCA GGAGCACGAA
GACCTGCTGG AGCTGATCCG ACTGCTGGGT GCCCGCGTGC TGGTGCTCGA CGCCGAACGG
CACGACGCCA TCGCCGCCGC CGTCAGCCAC CTGCCCCAGC TTCTGGCCGT GCTGCTCGTC
AACACGGCCG CCGAACTCAG CAAAGGCGAC GAGACGTTCC TGCAGCTGGC CGCCGGGGGC
TTCCGCGACA TGACCCGGAT CGCATCGTCG CCGTTCGACC TCTGGCGCGA CGTGCTCTTT
GCCAACGAGG GGCCGCTGCT CGACACGCTC GGTCACTTTG CGGCCAACCT GCAACGCCTG
CGCAACCGCA TCATCGAAGA AGACGAGCAG GCACTCGCGG AAGCCTTTGA GCAGGCCCGC
CGGACGCGTG CCCGCATCCC GCGCGACACG AAAGGCTTCC TGCACCCGCT GGCCGACGTG
TACGTCCGCA TCGAGGATCG CCCCGGTGCT CTTTACCGGA TCACCCGCAC CCTCTACGAG
GCCGGCCTCA ACATTCAGGA CATCGAACTG CTGAAGGTGC GCGAGGGCAC AGGCGGTACC
TTCCGACTGG GCTTCGCCAC CGAGGCCGAC GCCGACCGCG CCTGCGAGGC GCTTCGCCAG
GCCGGCATCG AAGCCTTCCG TCCCGACGAT CACGGAAACT GA
 
Protein sequence
MIERITICGL GLIGGSLAMA WKRARPELHL TAFDRREVLR QARELGVVDA TAEDVAEAVA 
EADLVVLAAP LRGILYLLEE IGPHLKPGAR VTDVCGVKRP IMAHAREMLP ETVTFIGGHP
MAGSERRGLA NADPFLFENA TYVLCPPPGS DAVRLQQEHE DLLELIRLLG ARVLVLDAER
HDAIAAAVSH LPQLLAVLLV NTAAELSKGD ETFLQLAAGG FRDMTRIASS PFDLWRDVLF
ANEGPLLDTL GHFAANLQRL RNRIIEEDEQ ALAEAFEQAR RTRARIPRDT KGFLHPLADV
YVRIEDRPGA LYRITRTLYE AGLNIQDIEL LKVREGTGGT FRLGFATEAD ADRACEALRQ
AGIEAFRPDD HGN