Gene Moth_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1917 
Symbol 
ID3830841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1987003 
End bp1988676 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content50% 
IMG OID637829850 
Producthypothetical protein 
Protein accessionYP_430760 
Protein GI83590751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00808951 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGTT ATCAGGAGAT ACTTAAGGAC GCGCCGCTGG ATTTCCTACA GAAGCTGGCC 
GGGAATCTGA ATTTGGCGAC AAAGGGTGGA AAAAGAGCAG GCAGGGGTTC AGAGGACGGC
TGGCAGGCAT TATACAATAA ATTGGTGGCG TATTATAGTT CCCCGGACAA CCTGGAAACC
CTCTGGCAGA AAATCGGGCC TTCAGGCCAA CTGGCGCTGG AAACAATCCA TTTTAGCGAA
CCTTATTACG ATGAGGTGAG CAGGGTCCAT GAGAGGCTCA ACAAAATCCT CGGAAAAAGG
GCGGCAAGGG AGGCGCGGGA ACTTCTCCTG GGTTGGGGCC TGATATTTTT AAGTGAAAAC
GAATACGGAC TGGATTATTA CGATCTACCT CTAGAAGTAC GTAAATTTAT CAACTATAAA
GTACTTCCCC TGCTGGTAAA AAAAGACGGC ATCCCGCTAC CGGAACAAAG GGAAAACCAC
GGTCTCTTTT TCTGGCTGGA TTTTTATATT TTACTGGCCG GGGTCCTGCA ACGCGAGGTC
AGGGTGACCC AGACCGAGCG TGTTTTTTAT AAAAGGGACC GCAAGAAGAT CATGCTCTGC
CAGCACTACC CTGATGACGA AAGCCGCTAC CTTTTGCTGG AAGAGGTGGC CTGGGCGAAT
GACTTTTTAG TTGAAAAAAA TGGTTGCGCC CGGTTGAGCG CGAAAACTTG GCAATGGTTG
CAGCTACCGC GCTATAAACA ATGGCTTACA TTTGTCGACT GGGTAATAAA CCGTTATTTC
CAATGCCGTA ATATTTGGGC CACCCATATC CTGGGATTTC TGTTAACATT ACCGCCGGAG
AAATGGCTCT CCTTGCCAGC CCTTTACCAG CTAATCCATA AGTATAACAC GCCCTCCCAG
GCGAATTATA TCCTGGCGAA TAACAAGGAG TTGTTGCAAA GATTTCTCTG GCTGGGCCTG
ATTGAGGTTG CCGGCGGTTT GGAACAAGGA TGCATCAGGA TAACCGATCT TTTCCGCCGC
TACTTTAATG TTTTGCTCCA GCACGACCAA GAAGTAGAAG AAACGGACGG GGAGGTTTTC
AGGGAGGCGA TAGAGGGTTT CTTTCCGGAA GCATCCAGTT TTATCGTGCA ACCCAATTTT
GAAGTCATCG CCCCCATGGA ACTCTCCCCC AATCTGTTCA TGCAGCTAAG CACCTTTACC
GATCTAGTCA GCGCCGACCG CATGTTTATT TTCAGCCTCA ATGAGAAAGC ATTTTACCGG
GGTTTTACCA GGGGCCGGCA ACCGGAAGAG ATGCTAAAAT TCCTGCAGGA ACACAGCAAG
TATGAATTAC CGCCCAATGT TCTTACAACG GTGGAGGAAT GGGCGGCAAA AATGGGGAAG
GTCTCCTTAG TGAAAGGGGT GCTGGTGCGT TGCCAGAAGG AAGAACTGGC GGAACAGGTG
AAAGCCCTGC TGGAAGCCAG GGGGTGGCTG ATCGAGGCCA TAACGCCGCA GGTCTTCCTG
GTGCCGGAGA ATAGGGGTGA AGAATGCCTG GAGTTGCTGG AGAAACAGGG CTTTATGCCC
CATCCCCAGC TGATTGCTTT GCGGGCGGGA GGGGAGGACG ACGTCGACCT CGATGAAAGC
CCGAATACCT TGCTGGCCCG GTTTATAGAG GCTGCCTTAA AAAAAAGAGG GTAA
 
Protein sequence
MDSYQEILKD APLDFLQKLA GNLNLATKGG KRAGRGSEDG WQALYNKLVA YYSSPDNLET 
LWQKIGPSGQ LALETIHFSE PYYDEVSRVH ERLNKILGKR AAREARELLL GWGLIFLSEN
EYGLDYYDLP LEVRKFINYK VLPLLVKKDG IPLPEQRENH GLFFWLDFYI LLAGVLQREV
RVTQTERVFY KRDRKKIMLC QHYPDDESRY LLLEEVAWAN DFLVEKNGCA RLSAKTWQWL
QLPRYKQWLT FVDWVINRYF QCRNIWATHI LGFLLTLPPE KWLSLPALYQ LIHKYNTPSQ
ANYILANNKE LLQRFLWLGL IEVAGGLEQG CIRITDLFRR YFNVLLQHDQ EVEETDGEVF
REAIEGFFPE ASSFIVQPNF EVIAPMELSP NLFMQLSTFT DLVSADRMFI FSLNEKAFYR
GFTRGRQPEE MLKFLQEHSK YELPPNVLTT VEEWAAKMGK VSLVKGVLVR CQKEELAEQV
KALLEARGWL IEAITPQVFL VPENRGEECL ELLEKQGFMP HPQLIALRAG GEDDVDLDES
PNTLLARFIE AALKKRG