Gene Moth_0259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0259 
Symbol 
ID3833222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp266099 
End bp267439 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content63% 
IMG OID637828195 
Producthypothetical protein 
Protein accessionYP_429137 
Protein GI83589128 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATGGCT TAAAATGGCT CTACCCGGGG CTTAAAATTA AACGTTGGCT GTTGCTGGCA 
GTCCTGGGTT TGCTGCTGCT TGTTTCTGGT CTAACGGTTA TCTTGGGGAT AACCCTGCTG
GCTTCGGCGG AAAAAGGAGT TACCTGGTTT ATCCTTCATA CCCTGGGTGG CCTGGGCTCG
CCCCTGCTGG CTGGTCTTTT GGCCATGGCC CTGGGAGCGG TCTTTATTGG GGTGGCCGTC
CGGAATCTGG CCCGTTCGGT TATCCAGGTT CTCTTGCCCG GTCATACCGC CAATCCCTGG
CAGGTTTTTT ACCGGCGCCA GTACCTGGCC CGGGGCCCCC ACCTGGTGGC CATCGGCGGG
GGGACGGGGC TGGCCGTCCT CTTGCGGGGT TTAAAAAACT ATACCCGCAA CCTGACGGCC
ATCGTCACCG TGGCCGATGA CGGGGGAAGT TCCGGTCGCC TGCGCCAGGA ATTGAGCATC
CCGCCCCCGG GGGATATCCG CAATTGCCTG GTGGCCCTGG CCGATACGGA AAGCCTCATG
GAGGATCTCT TCAGCTACCG GTTCCGCCAG GGCGAGGGCC TGGCCGGTCA CAGCCTGGGG
AATCTCCTCC TGGCGGCCAT GACGGATATG GCCGGTGATT TTGACCGGGC CATCCAGGAA
CTGGCCCGGG TCCTGGCGGT AGGGGGGCGG GTCATCCCCT CGACGACCAC CCATGTCGTC
ATGGGTGCCG AACTGGCCGA TGGCAGCACC GTCCTGGGTG AAAGCAATAT CCCCCTGGCC
GGCAAACCCA TTAAAAGGGT GTTTTTAAAA CCGGCTGACT GCCGGCCGCC GGCGGCGGCC
CTGGAAGCCA TTGCCCGGGC CGACGCCGTG ATAATCGGCC CGGGGAGTCT GTATACCAGC
GTCCTGCCAA ACCTGCTGGT GCCGGGTATT GTCGAGGCCC TGCGGGATAC CCCGGCACCG
GTCTTTTATG TTTGCAACAT CATGACCCAG CCGGGAGAAA CGGACGGTTA CACGGTGGCC
GACCACCTGC GGGCCCTCAT CGACCACTGC GGCCAGGGGA TAATAGATAC GGTAATCGCC
CACAGCGGCC CCATTTCCCG GGCCGCCCGG CGGCGTTACG GCGAAAAGGG AGCCCGGCCG
GTCCTGATTA ACAGCCCGGC AATCGCCAGG ATGGGGGTAG AGCTGCGCCG CGGCTGGCTG
GTAGACGAGA CCCATGTCGT CCGCCATCAC CCCGAACGAT TGGCCAGCCT GGTCATGGAA
GAGGTTTACC GGCACCAGGC CCGCGGCCGG CGGCGTTTTT TTTACCTGGT ACGGGAGAGA
TTTCGCACCC TGGCCCGGTA G
 
Protein sequence
MDGLKWLYPG LKIKRWLLLA VLGLLLLVSG LTVILGITLL ASAEKGVTWF ILHTLGGLGS 
PLLAGLLAMA LGAVFIGVAV RNLARSVIQV LLPGHTANPW QVFYRRQYLA RGPHLVAIGG
GTGLAVLLRG LKNYTRNLTA IVTVADDGGS SGRLRQELSI PPPGDIRNCL VALADTESLM
EDLFSYRFRQ GEGLAGHSLG NLLLAAMTDM AGDFDRAIQE LARVLAVGGR VIPSTTTHVV
MGAELADGST VLGESNIPLA GKPIKRVFLK PADCRPPAAA LEAIARADAV IIGPGSLYTS
VLPNLLVPGI VEALRDTPAP VFYVCNIMTQ PGETDGYTVA DHLRALIDHC GQGIIDTVIA
HSGPISRAAR RRYGEKGARP VLINSPAIAR MGVELRRGWL VDETHVVRHH PERLASLVME
EVYRHQARGR RRFFYLVRER FRTLAR