Gene Moth_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1109 
Symbol 
ID3833075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1136120 
End bp1137217 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content56% 
IMG OID637829037 
Product4Fe-4S ferredoxin, iron-sulfur binding 
Protein accessionYP_429966 
Protein GI83589957 
COG category[R] General function prediction only 
COG ID[COG2768] Uncharacterized Fe-S center protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.195082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCG AAGTTTTTTG GACCAATATG CGTACCAAAA AGGGGAAGAG CCTGGTTGAC 
AAGGTGGCTA CCCTGTGGCG CCGGGCCGGT TTCAAGGACG CCGTCGACCC CGGCGACCTG
GTCGCCATTA AAATTCACTT CGGGGAAAAG GGCAACACGG CCTATATCAA CCCCGTCTTT
GCCCGCCGGG TGGTGGAGTT AATCAAGGCC GCTAAAGGAA AGCCTTTCCT GACGGATTCC
AATACCCTGT ACGTAGGTTC CCGTTCCAAT GCCGTCGATC ACCTGCAGAC GGCCATTGAA
AACGGCTTTG CTTATGCTGT AGCCGGGGCG CCCCTGGTTA TCGCCGATGG CTTGAAGGGA
AAAGATTATT ACGAGGTGCC GGTAAGCGGC AAACATTTTC AAAAGGTAAA AATCAGCAGC
GCCATCTACC ATGCCGATAG TATGGTCGTC ATGAGTCACT TTAAAGGCCA CGAGCTTGTT
GGTTTCGGTG GGACCCTCAA AAACCTGGGC ATGGGTTGTG CCAGCCCTTC CGGCAAGCAG
ATGATGCATA GCGACGTTCT GCCCCGGGTT GAAGAAGAGG AGTGTATTGG TTGCGGCAAG
TGCCGGCGCT GGTGTCCAGC CGGGGCCATT ACCGTCACCG AAAAGGCGAC GATTAATGGC
GAGCTGTGTA TCGGTTGCGG GGAGTGTACC GTCACCTGTC CCCGCAGGGC CATCAAGGCC
AGCTTTAAGA GTGATCCAGT GGTCCTGCAA GAAAAGATCG TCGAGTTTGC TTACGGTTCC
CTTAAGGGCA AGGAGAACAA GTGTGTCTTT TTTAACTTTG TCACCCACGT AGCCCCGGAA
TGCGATTGTA ATTCCTGGGA CGACGCGGCC ATCATCCCTG ATGTTGGTAT CCTGGCTTCC
CTGGACCCGG TAGCCCTGGA TCAGGCCAGT TTTGACCTGG CCAATGCCCA GCCGGCCCTG
CCGGGGACCC GCCTGGACGG TCACGAGGAC GCCAGGGACA AGTTCCAGGC CGTCAGCGGT
TATGACGGGA CACCCCTTTT AAGATATGCC GAGGAAATAG GCATGGGGAC ACGGGAGTAT
AACCTGATAA AGGTTTAA
 
Protein sequence
MAAEVFWTNM RTKKGKSLVD KVATLWRRAG FKDAVDPGDL VAIKIHFGEK GNTAYINPVF 
ARRVVELIKA AKGKPFLTDS NTLYVGSRSN AVDHLQTAIE NGFAYAVAGA PLVIADGLKG
KDYYEVPVSG KHFQKVKISS AIYHADSMVV MSHFKGHELV GFGGTLKNLG MGCASPSGKQ
MMHSDVLPRV EEEECIGCGK CRRWCPAGAI TVTEKATING ELCIGCGECT VTCPRRAIKA
SFKSDPVVLQ EKIVEFAYGS LKGKENKCVF FNFVTHVAPE CDCNSWDDAA IIPDVGILAS
LDPVALDQAS FDLANAQPAL PGTRLDGHED ARDKFQAVSG YDGTPLLRYA EEIGMGTREY
NLIKV