Gene Moth_1929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1929 
Symbol 
ID3830853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2001953 
End bp2003881 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content58% 
IMG OID637829861 
Producthypothetical protein 
Protein accessionYP_430771 
Protein GI83590762 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.943643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0112851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTGGT TTCTTCTGGG CACACTGGCC CTGCTCACCG GGTCCTGGCT CATGGCGGCC 
CGGATGGCGT TGAAAAAAAC TCTCGACCGC TGGCTGGCCG TAGGGGTATT GACCATGGCC
GGGCAGGTAC TGATCCTCCT GCTCGCCGGT CAAGCCCTGC GGCATCTTAA CCGCGGCGTG
GTATTGCTCC TGGCCCTGGG CTGGGCGGGG CTCGGGTTGC TGATCCGGAA CAGGCAGGGG
AGTAGATCCG GTGAGATTTT CTCCTTTGAA AGCGGGGCGG CCGGGGATAT AGGAGAACCC
GGGAACGCCG AGATTTTTTC CAGTTCCCTC ATCGCAGTAA CCGGTGTCAT TCTGGCCTTT
AGCCTGGCAG CCCTGGTCCT GGGGTGGATT TACCTGCCGC CCTTCGCCTG GGATGAGATC
TGGTATCACC TTACTCCTAT GGCCGCCTGG TTCAAACAGG GGGCCATTAC CCGGCTGCCG
GAGGCCCTAC TCTGGCAGCA CTACGATCCC ACCCGGGTAA CCTCCAGAAA GCTCGCCCTG
GATTTAAGCG TGGCCTTTAA TTGGGCCAAT GTTTACCCCC TTAACGCCGA ATTAAATGCC
CTATGGATCA TGGTCCTGAC GGGTAACGAC CTCCTGGTGG ATGCCACCCA GTTCCCTTAT
GTTGTAATCG GCGCCCTGGC CACCTTCGGC CTGAGCAGGT CAGGGGGTGC CGGACGGAAT
GCATCGATTC TGGCTTCTAT GCTCTTCCTT TTGACCCCCA TGGTTCTGAT CCACCTGCGG
GTAGCCTATG TCGACGCCGC CTTCGGCTCT ATGGTAGCCG CCGCCCTTTA TCTTTTTTTA
AGGTGGCAGG AGGAGCGCAA TCTGGAGTAT GCCCTGATCC TGGGCCTGGC CATTGGGCTG
ATGATGGGGA TTAAAGCTAC GGGCATCGCC TTTGCCGGTG TCTTCGTCCT GGCTATACTG
GCCTCAGGCT TCTGGCAGTA CCGGCAAGGG GTATCCGGCA ACAGAGCTCT CTGGCTCCAG
ATAACAGTGA TTTTATTGTC CCTGATGGCC ACGGGCACTT TTTGGTACTT GCGAACCTGG
TGGTTTTACG GTAATCCCGT TTACCCCGTA GAGATTCAAG TGCTGGGATG GAAGCTTCCC
GGCATGGGCA GTGTCTCCAG GTTATTTATG GCCGCGAATA CACCGGAAGC CTACCGGGGC
CGCAACATCA TCCTCAATAT CCTTACTTCC TGGTTGGAGC TTGGTAACGA ATCCTACAAT
TATTATTCCC GCACCCGCGG CCTGGGACCG GCCTGGGCGG CCCTGGCCCT CCCCGCTATC
CTACCCTTCG CATTTTCAGC CTGGCGCCGG CGCCGGGCCC CGGTACTCTG GATGGTGGGC
CTGACCCTTA TTTTTTTAAT CCTCCAGCCG GCTGCCTGGT GGCCGCGCTA TGTCCTTTAT
GTAGTGCCGG TGGGACTGGC GGCTATGGCC TGGGTTTATG ACCGCCTCGA CCCCAGGTTG
AAAATGGCTG TGGCCGGAAT CATGATTCTG AACCTGGTGG TGGCCACCGG TTTGACTCTG
GTAGAAACCC TGGATAAGCT ACCTGCAGCC ATGCGCCTGG ACCCAGCTCA CCGGACCTTT
GGGGAACTGT ACTTCAGTGA CTATGCCTGG GTGGACCAGC TGCCTCCCAG CCGGATTGGT
TATACGCCTA TGGCCTGGAT CTACCCCCTG TATGGCGGCC TGCGTAACCA GGTGGAGCTG
GTCGATGGAA CCACGGCGGA GTCCTGGCGG GAGGCCATCA TCTGTCAGGG CCTGGATTTT
GTTGTTACCA ACACCCAGTA TGGCGACTAC GATAGATGGG CCCTTTCACT GCCTGATTTA
TTAACACCCT ACAGGAAAGG GGAGATGATT AATGTCTACC GGGTCAGGCC CGGCCGGGGA
CGACAATGA
 
Protein sequence
MLWFLLGTLA LLTGSWLMAA RMALKKTLDR WLAVGVLTMA GQVLILLLAG QALRHLNRGV 
VLLLALGWAG LGLLIRNRQG SRSGEIFSFE SGAAGDIGEP GNAEIFSSSL IAVTGVILAF
SLAALVLGWI YLPPFAWDEI WYHLTPMAAW FKQGAITRLP EALLWQHYDP TRVTSRKLAL
DLSVAFNWAN VYPLNAELNA LWIMVLTGND LLVDATQFPY VVIGALATFG LSRSGGAGRN
ASILASMLFL LTPMVLIHLR VAYVDAAFGS MVAAALYLFL RWQEERNLEY ALILGLAIGL
MMGIKATGIA FAGVFVLAIL ASGFWQYRQG VSGNRALWLQ ITVILLSLMA TGTFWYLRTW
WFYGNPVYPV EIQVLGWKLP GMGSVSRLFM AANTPEAYRG RNIILNILTS WLELGNESYN
YYSRTRGLGP AWAALALPAI LPFAFSAWRR RRAPVLWMVG LTLIFLILQP AAWWPRYVLY
VVPVGLAAMA WVYDRLDPRL KMAVAGIMIL NLVVATGLTL VETLDKLPAA MRLDPAHRTF
GELYFSDYAW VDQLPPSRIG YTPMAWIYPL YGGLRNQVEL VDGTTAESWR EAIICQGLDF
VVTNTQYGDY DRWALSLPDL LTPYRKGEMI NVYRVRPGRG RQ