Gene Mrub_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMrub_1841 
Symbol 
ID8879949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus ruber DSM 1279 
KingdomBacteria 
Replicon accessionNC_013946 
Strand
Start bp1896492 
End bp1898882 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content65% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003507618 
Protein GI291296220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTAT TGCTGGGGTT GTTTAGATTT TGGCTTCACC TGTCCACTCT ACTTTTACTG 
GCACTGGCAG GCTGCACCCG CCCTGCGCTG GAACCCGCCC TACAGATCAA CTTTTCCCCA
GACCGCCTGC GCCTTGTGCC GGGCGGGGCC GGAGAGGTCA ACATTACCCT TGTGCGCCAG
AATCTGCCTG GAGTGGTAGC ACTGGATCTG CCTGACCCTT TACCAGAAGG CGTAACCGCC
TCTTTTACCC CTTCCAGCAC AACCGGCGAC ACCGCCACCC TACGGGTTCA GACCTCGAGC
AGTGCAGCCG TGGGCAGCTT TAACTTGCGG GTAAGGGCTG CCCAGGGATC CATAAGCGCC
ACCAACGACC TACCCTTGCA GATAGAACCC AGCCAGCAAC CCGACTTGCA GTTATTCCTG
GACAACCCCA CCCCCTCCAT CCGCCAGGGG CAGCATGCAG ATCTTTTAGT TGACGTGCGG
CGCATCAACC TCGAGGGCGT GGTGGTACTG ACCCTCGAGC AGCAGGACGG CCGCCCCCTG
CCGCCCGGCC TGAGCGCGAC CTTCAGCCCC GGCCAGCCCA GCACCACCGT CTCGATTCTG
CGGCTTGCAG CGGCCCCCAC CGCTACCACA GGCCCCTACC CCCTGCGCAT CAAGGCCACC
TGGGGCAGCC TCGAGCGCAC CCTGGACTTC ACCCTCACAG TCCTGGAAGC CCTGCCAGAG
CCCGACTTCC AGCTCAACCT GACCCAGAAT CTCAGCCTTC AGCGGGGCGG AACCAGCAGC
GAGACCATCA CAATTACTCG TGTCAATTTG TCCGGCCCCA TCGCCCTGAG CCTCGAGCGC
TTCGACGGCA CCCCGCTACC CCCTGGCATC AGCGCAACCT TTGCACCAGC AGAACCAGAG
GGCCCCCATT CCACCCTTAC CATCTCCGCT GCCCCCGACC TGCCCCTGGG AGACTATGTG
CTACGGGTGC GGGGCGTGCA AGGCACGCTC GAGCGCACCG CTCTGGTACT GCTAAATGTG
TTTGATCAGG CCGGGCTTAC CGCTACCGGA GCCGTCTGGG TTGCCGCCCA GGACGATAGT
GGAGCCTGGC AGGTAGTACA ACCCACCGCC GGCAGCTACC GCCTGCGCGT CAGCAACGCC
GCAGAACGCT ATGGCTGGGC GGTGGTGTGC AGCAAAACCG AGGCCGGCCT CACCACCCAT
CAGGTGAGCG TCTATCAGCT CACCCTAAGC GAGGTGCGCA CGCTGAGCCT GTCCTGCCCC
CCGGCTGCCA GCAGCGGCGC CTTCTCAGAC CTCAGTGGAC AGCTCACCAA CCTGGATGGC
CGTCATGCCC AGGTGGCCTT TGGCACGGCC AGCGACTTTG TGGATCCGGC CCGAACAGCC
GACTTCCCCC CCACCCCAGC CTACCCCGGT TACCTGCTGC AAGGCGTGCG GCAGGGCACC
GCCGATCTGA TGGCCGTGCG CTACCTACCA CCCACCCCAC CAGGAACCTA CTTCCAGGCC
GACCGGGCCC TGTTCCAGCG CAGCTACACC CTGAGCGGTC GGCAGAGCCT CGACCTGGAT
ATGCAAGGTT CTGCCTCCTT CGCGCTCGAG GGCACCTACA CCGCCACCCT GACCAACCCC
AACCCCGCCG CCCAGGGCCT GAGCTACCTG GCCTACCTGA CCCCCACCAC CCAGACCCTT
TACCTGGCGG ACAGCCAGCA ACAGGCCGCC GATCTATCCT ACGGCGCCAT CCCCACCAGC
CGGCGGCTGG CCAACGAGTT CTATGTTTTC TATGCCCGCG AGACCACCTT CAGCAACCTG
AGCCTGCGTT CCCGGCAGGC GCTGCGGGGC TTTGCCAACC CCCAGAACCT GAGCGCCAGC
TTCCTGAGCC TGCCGGAAGC CCATCTGAAC CTGCAAAACA ACCGCTTCCA GACCACCTGG
AGCCCGTATT CGTGGTCGGG CAGCGGCAGC CAACTGTTCA GCCTGAAGCT CGAGCAGTTG
GCGGTGGCCC CCAGCACCAA CCTGGTGTGG CACCTCCACC TTTCGCGCCG CTGGGTGGGT
GGTACCACCA GCTACCCCAT TCCCAACCTC GCCCAGAGCT GTGCGGCCGC CCAATCCCCT
TGTGTTCCGG CTCCTTCCAA CACCGCCACC AACGGCTGGC AGAGTGACTG GAGTTTGCGC
GATAACCTCG AGCTTGACTG GTCGTTTAGC GCGGTGCAGG TGAGCCTTCC CCTAACCGAC
TGGCTGCCCC TGGCCCCGAG CCCCTTCCCA CCCACCAGCC TCAACCTGGA GGGCTTCAGC
TTTGATGCTG CCAGCGCTGG GGGCCTTTAC AACGCCAGCA GGCTCTCGGC GCAGCACCAG
GTAAAAGCCA GGGAGCCGCG CTTACTGCCC TTCTGGCTTG CACCCCGCTG A
 
Protein sequence
MRLLLGLFRF WLHLSTLLLL ALAGCTRPAL EPALQINFSP DRLRLVPGGA GEVNITLVRQ 
NLPGVVALDL PDPLPEGVTA SFTPSSTTGD TATLRVQTSS SAAVGSFNLR VRAAQGSISA
TNDLPLQIEP SQQPDLQLFL DNPTPSIRQG QHADLLVDVR RINLEGVVVL TLEQQDGRPL
PPGLSATFSP GQPSTTVSIL RLAAAPTATT GPYPLRIKAT WGSLERTLDF TLTVLEALPE
PDFQLNLTQN LSLQRGGTSS ETITITRVNL SGPIALSLER FDGTPLPPGI SATFAPAEPE
GPHSTLTISA APDLPLGDYV LRVRGVQGTL ERTALVLLNV FDQAGLTATG AVWVAAQDDS
GAWQVVQPTA GSYRLRVSNA AERYGWAVVC SKTEAGLTTH QVSVYQLTLS EVRTLSLSCP
PAASSGAFSD LSGQLTNLDG RHAQVAFGTA SDFVDPARTA DFPPTPAYPG YLLQGVRQGT
ADLMAVRYLP PTPPGTYFQA DRALFQRSYT LSGRQSLDLD MQGSASFALE GTYTATLTNP
NPAAQGLSYL AYLTPTTQTL YLADSQQQAA DLSYGAIPTS RRLANEFYVF YARETTFSNL
SLRSRQALRG FANPQNLSAS FLSLPEAHLN LQNNRFQTTW SPYSWSGSGS QLFSLKLEQL
AVAPSTNLVW HLHLSRRWVG GTTSYPIPNL AQSCAAAQSP CVPAPSNTAT NGWQSDWSLR
DNLELDWSFS AVQVSLPLTD WLPLAPSPFP PTSLNLEGFS FDAASAGGLY NASRLSAQHQ
VKAREPRLLP FWLAPR