Gene Moth_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0886 
Symbol 
ID3831524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp917207 
End bp918961 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content65% 
IMG OID637828816 
Producthypothetical protein 
Protein accessionYP_429746 
Protein GI83589737 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTG ATGGCCTTTT CCTGGCCGCC ATCAGCGCAG AACTATCCGG CCTGACAGGC 
AGCCGGGTGG ACCGCATCTT TCAACCGGAA AAGGAGACCG TTATCCTCCA CCTGCGTAAA
GGTCGCGACA CCAGGAAGCT GCTTCTTTGC AGCCTTTCCG ACCAGGCCCG GGTCCACCTG
ACGACGGCCA GTTTTACCAA TCCCCCCACC CCGCCCCTTT TCTGCATGGT CCTGCGCAAG
CACCTGGAAG GGGGTATTTT GACGGCCGTC GAGCAGCCGG GCCTGGAACG GGTGCTGAAA
CTCCACTTTA ACACCACCGA CGAGCTGGGA CGGCAGGCCC CCCGCTTGCT GTTAATTGAA
ATCATGGGTA AGCACTCCAA CATCATCCTG CTCAACCCGG AAGGTAGCAT CATCGACGCC
GCCCGCCGCT ACACCCATGC CGTCAGCCGC CACCGGGAGG TCCTGCCCGG CCGGCCCTAC
GTCCCGCCAC CGGCCCAGGA CAAGGCCGAT CCCCGAAAAC TCGACGACGA GGCCTTCACT
CGCCTCCTCT ATGAGGGTAA CTGGGGCGAT CCCCTGGAGC GTCTGCTGGT AAACAGGCTG
GCCGGCGTGG GGCCGGAAAC GGCCCGGGAG ATTATCCACC GCGCCGGCCT GCCGGCCGGG
ACGACCCTGG AGGGCTGCGG CGCGTATGAA GTGAACCGCC TCTACCAGGC CCTGGGGGAG
GTGCTGGCGG CCACCGGCCC CGCCGCCTGG AAGCCGGAGG TCATCCTCCG GCCGGAGGGG
GAACCCCTGG CCTTCGCCTC CTTTGAGCTC CACCAGTACC AGGGTTTGCC CCGGGAGCAC
CCGGCCACTC CGGGCGCCGC CTGCGATTAC TTCTACTCCC TCCGCCGGGA GCACCAGCTC
CTGGAAGGTA CCCGGCGAAG CCTGGAGCAT GTCCTGGAAA AGGAGTTAAA GCGCTGCCGC
AAAAAGGAGG GCCTCCAGGC CGCCACCGTA GCCGAAGCCG CCGGGGCGGA GGAGTTCCGC
CTGGCCGGGG AGCTCATCAC CGCCAATATC TACCGCATTA AAAAGGGCCA GGCCAGCCTG
ACGGCGGCCA ATTTCTACGA CCCGGACGGC GAGCCCGTTA CCATCGAGCT CGACCCTTCC
CGTACCCCGG CGGAAAACGC CCAGTGGTAC TTCAACCGCT ACAACAAGGC CAAGCACGCC
GCCCGCCTGG CAGCCGCCCA GCTGGAACAA ACCAAGGCTG AAATAGCCTA CCTGGAGAGC
ATCGCCCAGG CCGTCAGCAT GGCCGCCACC AGGGACGACC TGGAAGAGAT CCGCCGGGAA
TTGCGCCAGG CCGGTTACCT GCCTGAGGAA AGGGACAAAC AAAAACCCGG TAAAAAGGCC
GCTAAACCAG AAGCGCATCA GCCCTCCCGG CCCCTGGAGT TTACTTCCCC GGATGGTTTC
AAGATCCTGG TGGGCAAAAA CAACCGCCAG AACGACTGGC TGACCCTGAA ACAGGCCGCG
GATGGCGACC TCTGGCTCCA CGCCAAGGAT ATCCCAGGTT CCCACGTAAT TATCCGCACC
GGGGGCCGGG AGGTGCCCGC TACCACCCTG GAAACGGCCG CCCGCCTGGC GGCCCGCTAC
AGCCGCGCCG GCCAGTCCAG CCGGGTGCCG GTAGATTACA CCCTGGTGAA ACACGTCCGG
AAGCCCCCCG GCGCCAGACC GGGAATGGTC ATCTACGACC ACCAGCGGAC GGTTTATGTC
ACGCCGGCGG AGTAG
 
Protein sequence
MAFDGLFLAA ISAELSGLTG SRVDRIFQPE KETVILHLRK GRDTRKLLLC SLSDQARVHL 
TTASFTNPPT PPLFCMVLRK HLEGGILTAV EQPGLERVLK LHFNTTDELG RQAPRLLLIE
IMGKHSNIIL LNPEGSIIDA ARRYTHAVSR HREVLPGRPY VPPPAQDKAD PRKLDDEAFT
RLLYEGNWGD PLERLLVNRL AGVGPETARE IIHRAGLPAG TTLEGCGAYE VNRLYQALGE
VLAATGPAAW KPEVILRPEG EPLAFASFEL HQYQGLPREH PATPGAACDY FYSLRREHQL
LEGTRRSLEH VLEKELKRCR KKEGLQAATV AEAAGAEEFR LAGELITANI YRIKKGQASL
TAANFYDPDG EPVTIELDPS RTPAENAQWY FNRYNKAKHA ARLAAAQLEQ TKAEIAYLES
IAQAVSMAAT RDDLEEIRRE LRQAGYLPEE RDKQKPGKKA AKPEAHQPSR PLEFTSPDGF
KILVGKNNRQ NDWLTLKQAA DGDLWLHAKD IPGSHVIIRT GGREVPATTL ETAARLAARY
SRAGQSSRVP VDYTLVKHVR KPPGARPGMV IYDHQRTVYV TPAE