Gene Moth_1765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1765 
Symbol 
ID3831057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1820654 
End bp1822066 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content60% 
IMG OID637829690 
Producthypothetical protein 
Protein accessionYP_430609 
Protein GI83590600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA TCCTGCCGGA ATTTTTACGC TGGCAGCCGG GGGCGGAGCA GTGGGCCGTC 
CCGGTCAGCG CCTGGGATAC CGGCCCCACC AGCTACCTGA GGTTGCGGCG GGTATTGCTG
GACGGCCGCC CGGTCGCCGG GCGTAATATT ATCTTTCCTG GGGTAATGCC GGTTTACCTC
CTGCCTGCCG GGGCCCGGAC CTCCGACCCG GCGGCTTACT TAGCCACCCT GGGAAGAGAA
GATAAGCGCT ACGCCTGTTC GTGGTTGATC CTGAAGACGG CTGGTCTGTC CGCTCTGCAG
GTTACCGGCG GTGAACACTC CCTGGAACTG GAGTTTATCA CCTTTGCAGG TATGACCTGC
CGCGCCGCCA CCACCATCCT GCTGGCACCC CTGCCAGCCC ATCCCCACTG GCAACCGGTG
GAGCTGCAAA TGCATAACTC CAGCCACGAC GACGGCCACT GGTCGCCGGC AGCAGTAGTA
AAGGAACTTG TTGGGCGAGG CTACCGGGCC CTGTATTTCA CTCCCCACGC TGATTTGATC
GCCGGTTTCT GGGAAGAGTT CGCCGGCCTC TGCCGGGATT TATCAGGCAC CATTGCCGTC
TTCCCGGGCC TGGAACTAGC TACCAGGAAT AGCGCCGGGC ACCTCCTTAT TTATGGCTTG
ACGGCCCTAG AAGGCTGGCA GAATGCCAGG AACCCCGGCC AGGTGATTAT TGACCGGGTT
AACCGCTTAC CTGGCCATAC CGCCTCGGTA ACCGTATCCC ACCCCTTCGG TCGGCCTCCC
TGGCCCTGGG AGGACGAGCC GGTGGTGGAT TACAGTGGTC TGGAGGTCTT TTCCGGCCTG
CAGTGGTACT TCGACCTGGA GTCGCGACCC CTGCAACTGT GGCGCAGGGA GGTGGCGCGC
CTGTCCGGCA GGGTTTTCTT AACTGGCTAT TTGCCTTCGG CCCGAGCCGG TAGCGACTGG
CATCAAGTAC TCCCCTATCA GGGCTATGTA ACTTACGTCT ACCTACCCGA CAGCTGGGCG
GGCCTACCCT GGCAGGAACA GAAATACTTC CTGGACCTGG CCCTGCGGCG GGGGTATACT
GTGGCCAGCC GCCGCGGGGG TCTGGCTTAT TTTTTAATTA ACGGCCAGCC GCCGGGGACT
TCAGTCACCC TGCCGCCGGG TGCTATTTTG GAGATCAAAA TCTACTGGCA GGGGGTAGTA
GAGGGCGATT ACCAGTTTTT GCTTTTCCAG GGCTATAAAA ATATGGGAAA AGCCATCTGG
CAGGCGGAAA CCAGAGGCGC TGGAGGGGGG CGCAGGCCTG CTTGGAAAGT TGAACTGGCG
GCACCGGGAG AAACATCCTA TTACTGGCTC TATGTGTCCG GGCCGGATCA GGTCCTAACC
TCACCGGTTT TTTTAAGACC GGCGAGGCGT TAG
 
Protein sequence
MKIILPEFLR WQPGAEQWAV PVSAWDTGPT SYLRLRRVLL DGRPVAGRNI IFPGVMPVYL 
LPAGARTSDP AAYLATLGRE DKRYACSWLI LKTAGLSALQ VTGGEHSLEL EFITFAGMTC
RAATTILLAP LPAHPHWQPV ELQMHNSSHD DGHWSPAAVV KELVGRGYRA LYFTPHADLI
AGFWEEFAGL CRDLSGTIAV FPGLELATRN SAGHLLIYGL TALEGWQNAR NPGQVIIDRV
NRLPGHTASV TVSHPFGRPP WPWEDEPVVD YSGLEVFSGL QWYFDLESRP LQLWRREVAR
LSGRVFLTGY LPSARAGSDW HQVLPYQGYV TYVYLPDSWA GLPWQEQKYF LDLALRRGYT
VASRRGGLAY FLINGQPPGT SVTLPPGAIL EIKIYWQGVV EGDYQFLLFQ GYKNMGKAIW
QAETRGAGGG RRPAWKVELA APGETSYYWL YVSGPDQVLT SPVFLRPARR