Gene Rmar_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_0472 
Symbol 
ID8567106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp518649 
End bp520697 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content67% 
IMG OID 
Producttransketolase 
Protein accessionYP_003289762 
Protein GI268316043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAC AGAAGGCGAC GGAGGCGATT CGGGAAACCG ACTTTTCGCC GGGTGAACTG 
GACAATCTGT GCATCAATAC GATTCGATTT CTGGCCGTCG ATGCCGTCGA GCAGGCGAAG
TCCGGCCATC CAGGCATGCC GATGGGCGCG GCGCCAATGG CCTACGTGCT CTGGACCCGG
CATCTGCGGC ACAATCCGCG CGATCCGAAG TGGCCGAACC GGGATCGGTT CGTGCTTTCG
GCCGGTCATG GCTCGATGTT GCTCTACGCG CTGCTGCACC TGACCGGCTA CGACCTGCCC
ATGGAGGAGT TGCAGCGCTT TCGTCAGTGG GGATCGCGCA CGCCGGGCCA TCCGGAATAC
GGGTTAACGC CCGGGGTCGA GACCACGACG GGACCGCTGG GCCAGGGCTT CGGAAATGCC
GTCGGGATGG CGATCGCCGA GCAGTATCTG GCGGCGCATT TCAACCGGGA CGGATTCCCG
CTGTTCGACC ATTTCACGTA CGTGATCGCC TCCGACGGCG ATCTGATGGA GGGCATCTCG
CACGAGGCGG CTTCGCTGGC CGGACATCTC GGGCTGGGGA AGTTGATCGT GCTCTACGAT
GACAACGACA TTTCCATCGA CGGCTCGACG GACATCACCT TCACCGAGGA CGTCGGCGCG
CGCTTTGAGG CCTACGGCTG GCATGTGCAG CGCGTGGACG ACGGGAACGA CCTGGTAGCC
ATCGATGCGG CGCTTCGGCA GGCCAAAGCC GAGACCGAAC GCCCTTCGCT CATCATCGTG
CGCACGCACA TCGGCTACGG AAGCCCGAAC AAGCAGGACA CGCCCGCCGC GCACGGCGCG
CCGCTCGGAC CCGAAGAGGT GCGCCTGACC AAGCGCAACC TGGGCTGGCC CGAGGACAGG
ACCTTCTACG TGCCGGACGA AGTCTACCGG CACATGCGGC AGGCCGTCAC GCGGGGGCAG
CAGTGGCAGG CCGAATGGGA GGCGCTGCGC GCCCGCTATC GGGAGGCCTA TCCCGCCGAA
GCCGCTGAAC TGGACCGCTG GCTGAGCCGG CGGTTGCCCG AAGGGTGGAG CGAGGGACTC
CCGACCTTCG AGGCGGGCAA GGCCGTGGCC ACGCGTAATG CCGGCGGCGC CGTGCTCGAC
GTACTGGCCG CCCGTATTCC CGAGCTGATC GGCGGCTCGG CCGACCTGGC CGAGTCGAAC
AAGACGCATC CGAAAGGGCG CGAGGCCTTC AGCCGCGACA ACCGCAAGGG CGGCTACATC
CATTTCGGGG TGCGCGAGCA TGCCATGGCG GCCATCTGCA ACGGGCTGTC GCTGCACGGA
CTGCGGGCCT ACGCGAGCAC CTTTCTGGTC TTCAGCGATT ATCTGCGGCC GTCGCTGCGG
CTGAGCGCCC TCATGGAGCA GCCGGTCATC TACGTGTTCA CGCACGACTC GATCGGGCTG
GGCGAGGACG GGCCCACGCA TCAGCCGATC GAGCATCTGG CCAGCCTGCG CGCCATCCCG
CACGTGGTGG TGCTGCGGCC GGCCGACGCG ACCGAGACGG TGGAAGCCTG GAAGGTGGCG
CTCGAGCGCG AGGACGGTCC CACGGTGCTC GTACTGACAC GTCAGAACGT GCCGGTGCTG
GACCGGAGCC GCCTGGCACC GGCCGATGGG GTGCGCCGTG GCGCTTACGT GCTCAAAGAA
GCACAGGGCG CGCTGCAGGC GATCCTGCTG GCTTCGGGCA GTGAAGTGCA TGTGGCGCTG
GCGGCCGCCG AACAGCTCGA AGCCGAAGGC ATCGGCACGC GCGTCGTCAG CGTGCCTTCC
TGGGAGCTGT TCAAAAAGCA GGAGGCGGCC TATCGGGAAT CGGTACTCCC GCCGGAGGTG
ACCGTGCGGG TGGCCGTCGA AGCCGGGGTC GGACAGGGAT GGGAGCAGTT CGTGGGGTGC
CGGGGCCGCA TCGTCAGCAT CGAGCGCTTC GGCGCTTCGG CCCCCGGCAA GGTCCTGTTC
GAAAAATTCG GCTTCACGCC CGAGCGGGTG GCCAGTGAAG TGCGTGCGCT GCTGGCGCAG
AACCATTGA
 
Protein sequence
MQEQKATEAI RETDFSPGEL DNLCINTIRF LAVDAVEQAK SGHPGMPMGA APMAYVLWTR 
HLRHNPRDPK WPNRDRFVLS AGHGSMLLYA LLHLTGYDLP MEELQRFRQW GSRTPGHPEY
GLTPGVETTT GPLGQGFGNA VGMAIAEQYL AAHFNRDGFP LFDHFTYVIA SDGDLMEGIS
HEAASLAGHL GLGKLIVLYD DNDISIDGST DITFTEDVGA RFEAYGWHVQ RVDDGNDLVA
IDAALRQAKA ETERPSLIIV RTHIGYGSPN KQDTPAAHGA PLGPEEVRLT KRNLGWPEDR
TFYVPDEVYR HMRQAVTRGQ QWQAEWEALR ARYREAYPAE AAELDRWLSR RLPEGWSEGL
PTFEAGKAVA TRNAGGAVLD VLAARIPELI GGSADLAESN KTHPKGREAF SRDNRKGGYI
HFGVREHAMA AICNGLSLHG LRAYASTFLV FSDYLRPSLR LSALMEQPVI YVFTHDSIGL
GEDGPTHQPI EHLASLRAIP HVVVLRPADA TETVEAWKVA LEREDGPTVL VLTRQNVPVL
DRSRLAPADG VRRGAYVLKE AQGALQAILL ASGSEVHVAL AAAEQLEAEG IGTRVVSVPS
WELFKKQEAA YRESVLPPEV TVRVAVEAGV GQGWEQFVGC RGRIVSIERF GASAPGKVLF
EKFGFTPERV ASEVRALLAQ NH