Gene Moth_2415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2415 
Symbol 
ID3832166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2537446 
End bp2539083 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content62% 
IMG OID637830334 
Producthypothetical protein 
Protein accessionYP_431240 
Protein GI83591231 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0766121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGAGGC ACTACCGTTA CCTGCAGCGC TATACTGAAG TGGCTACCGT ACTTCTCCGC 
CACGGCTGGC AGGCTTACTG CGAAACCCGC CCCCGCCGCG CTCCGGCCCG GCGCCCCCTG
GTTACAGGTC GGGAGCCAGG CGATCCGGTT TACCGGCACC TGCGCCTGGC CTTTGAGGAA
CTGGGACCGG TTTTTATCAA GCTGGGCCAG CTCCTGAGTA CCAGGCCCGA CCTGATACCG
ACGGAGATGG CTGCGGAATT CAGCTACCTC CAGGACCGGG TACGGCCCCT GGCCCCGGAC
GTCATCCGGC AGCAGGTCTT CCGGGAACTG GGGACTGCTC CGGAAAAGGC CTTCAGTTAC
TTTGATTACC AGCCCCTCGC GGCGGCATCC ATCGCCCAGG TCCATAAAGC CCGGTTACCT
GGTGGCCAGG AGGTGGCCGT CAAGGTGCAG CGCCCCCAGC TCGATGGGGT GGTCGTTACC
GACCTGGCCG TCCTGGAGAA TCTCGGCCGG AGATTCAAGG GCACCGTCGT CGGCCGTATT
TGTGCCTTGG AGGAGATCCT GGCCACCTTT CGCCGCCAGA TTGAACGGGA ACTCGATTTT
ACCGTGGAAG CCCTGGCCAT GGAGAATTTT CGCCGCCTGT ACCGTGAGTT TCCGCAGATA
GTGGTCCCCA GGGTTTACTG GGATTATACG ACCAGGGGCC TCCTGACCAT GGATTACCTG
GCCGGGAAAA GGCTCAGCGA CTGGTACGGG AAGGGTACGG ACTGTCAGCG GGCAGCCCTG
CTTATCAAGG CGCTCCTGGC GCCTTTTTTC CAGGAAGGCA TCTTCCACGG CGACCCCCAT
CCGGGAAACA TTCTCTTTCT TCCCGGCGGT CGCCTGGGCT TAATTGATTT TGGCATCGTC
GGTCGCCTGG ATGAGGATTA TCGTTACCAG GCTGCCAGGC TGATTCTAGG CCTCCAGGAA
CGCGATTTAC AGGCCGTAAT GGAAGTAACC CTGAAACTGG GTAAGCCCAT GGCCGCAGTA
GATTACCAGG CCCTCTATGA AGACACGGCA GAACTGGTTG ACCGGGTGAC CGGCATGGGC
AAAGGGGATG TCAATCTGGC CGGTCTCCTG CTGGGAATGG TGGAACTGGC CCGCCGCCAT
AGCATCCGTA TGCCCGGTAC CTTCTTCGTC CTGGGGCGGA CGATTATGGA AGGGGAGAGC
CTGGCCCGCC GCCTGGATCC TTCCCTGGAT CTGGTGCAGG TAAGCGGGCC CCTGGCTGCC
AGTTACCTGC GCAGCCGCCT GCGTCCCAAC CCCACGCCCG GGCGAACCTA CCACCGGGCG
GCCTCAACCC TGCAGGATTT GCTGGAACTG CCGCGGGATA TCTCCCGCAG CCTGGATAAA
CTTGCCCGGG GGCAGTTAAC TACCATTTTT GTCCACCGGG GCCTGGAAAC CCTTTACCAC
AGACTGGATA TGGTTTCCGC CCGGCTTTCT GCCGCTCTCA TCGTGGCTGC CCTCATCGGC
GCCGGGGCCC TAATCCTCCA CGCGGGTGCC GGTCCTAAAA CCGGTGGCCT TTCCCTCCTC
GGTCTGGGAG TGCTGGGCGG CGCCCTTATC CTGGGTTGTC TCTGGGCTCT GCTCCTCAAG
GTAGGACAGA AGGAATAG
 
Protein sequence
MTRHYRYLQR YTEVATVLLR HGWQAYCETR PRRAPARRPL VTGREPGDPV YRHLRLAFEE 
LGPVFIKLGQ LLSTRPDLIP TEMAAEFSYL QDRVRPLAPD VIRQQVFREL GTAPEKAFSY
FDYQPLAAAS IAQVHKARLP GGQEVAVKVQ RPQLDGVVVT DLAVLENLGR RFKGTVVGRI
CALEEILATF RRQIERELDF TVEALAMENF RRLYREFPQI VVPRVYWDYT TRGLLTMDYL
AGKRLSDWYG KGTDCQRAAL LIKALLAPFF QEGIFHGDPH PGNILFLPGG RLGLIDFGIV
GRLDEDYRYQ AARLILGLQE RDLQAVMEVT LKLGKPMAAV DYQALYEDTA ELVDRVTGMG
KGDVNLAGLL LGMVELARRH SIRMPGTFFV LGRTIMEGES LARRLDPSLD LVQVSGPLAA
SYLRSRLRPN PTPGRTYHRA ASTLQDLLEL PRDISRSLDK LARGQLTTIF VHRGLETLYH
RLDMVSARLS AALIVAALIG AGALILHAGA GPKTGGLSLL GLGVLGGALI LGCLWALLLK
VGQKE