Gene Moth_1323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1323 
Symbol 
ID3831033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1368599 
End bp1369930 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content59% 
IMG OID637829259 
Producthypothetical protein 
Protein accessionYP_430179 
Protein GI83590170 
COG category[C] Energy production and conversion 
COG ID[COG1625] Fe-S oxidoreductase, related to NifB/MoaA family 
TIGRFAM ID[TIGR03279] putative FeS-containing Cyanobacterial-specific oxidoreductase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.106789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCACTA AGAGGGGCCG GATAATTGCC GTCCGGCCGG ATAGTATTGC CGCTGAGCTA 
GGAATTAACC CGGGCGATGA GGTGGTAGCC ATCAATGGAG AACCTGTGCC CGACCTCATT
GCCTACCGTT ACCTCTGTGC CGATGAAAAC CTCCAGGTTG AAATAAAAAA GGCTGATGGC
GAGACCTGGC TGCTGGATAT TGAAAAGGAT TACGGCGAAG ACCTGGGACT GGAGTTTTCC
GGTCCAACAT TTGACGGCCT ACGCCACTGC GCCAACAAAT GCCTCTTCTG CTTCGTCGAT
CAAATGCCCG CCGGCCTGCG GCCGGGCCTT TATATCAAAG ACGACGATTA CCGCTATTCC
TTCTTACACG GTAATTTTAT CACCCTTACC AACCTGAAAC CAGGGGACTG GGATTATATC
CTGCGCTGGC ACCTCAGCCC CCTCTATATA TCCGTCCATA CCACCAATCC GGAACTGCGG
CGGCATATCC TGGGCAACCC TCGGGCTGGA GCCATCATGG ACCAGCTTGG TCGCCTGGCC
GCAGGCGGTA TCCAGATGCA TACCCAGATT GTCCTCTGCC CGGGGCTGAA CGACGGCCCG
GAGCTGGAGC GCACGGTCAA GGACTTAAGC CGGCTTTTCC CGGCGGTACA GTCCATCGCC
GTGGTTCCGG TGGGCCTGAC TGCAGAGCGA GAAGGGCTAT TTCCTTTGAG GCGGGTAACC
CCGGGCGAGG CCAGGGAAAT AGTGACCCGA ATAGAGGAAT GGCAGTCCAG CTTCCGGCAA
AGCTTTGGCC GGGGCCTGGT CTACGGGGCC GATGAACTCT ACCTCCTGGC AGGGATACCC
CTGCCTGCGG CGGCTTATTA TGACGATTTT CCCCAGACAG AGAACGGCAT CGGTATCACC
CGCCTCTTTC TGGATGAGTA TGAAGTCGCG GTCAAGAAAA TCCCGCGGGC CCTGACCGGG
CCGCGCCGGG TAGTCGTCGC CACCGGGGTC CTGATAGCTC CTCTCCTGAC CAGGCTGGTT
CAACGGCTGG TAGCGGGGGT CACCAACCTG GAGGCCAGGG TGGTTGCGGT ACCCAATCGT
TTCTTCGGGC CAAAGGTGAC TGTAGCCGGG CTCCTCACCG GCCAGGATCT ACTGGCCGAA
CTGGGGGAGG CCGCCTCCTG GGCCCGGGAA AAGAAGGGCC TGGTTATCCT ACCGGACGTT
ATGTTGAAAA GCGATGCACC GGTTTTCCTG GACGACCGGA CGCCAGCAAT GCTTGCCAGG
GAATTAGGAG TACGGGTAGA GATTATCCCG GCTACAGGGG AAGGACTGGT AGCGGGGATA
TTAGAGGTAT AG
 
Protein sequence
MPTKRGRIIA VRPDSIAAEL GINPGDEVVA INGEPVPDLI AYRYLCADEN LQVEIKKADG 
ETWLLDIEKD YGEDLGLEFS GPTFDGLRHC ANKCLFCFVD QMPAGLRPGL YIKDDDYRYS
FLHGNFITLT NLKPGDWDYI LRWHLSPLYI SVHTTNPELR RHILGNPRAG AIMDQLGRLA
AGGIQMHTQI VLCPGLNDGP ELERTVKDLS RLFPAVQSIA VVPVGLTAER EGLFPLRRVT
PGEAREIVTR IEEWQSSFRQ SFGRGLVYGA DELYLLAGIP LPAAAYYDDF PQTENGIGIT
RLFLDEYEVA VKKIPRALTG PRRVVVATGV LIAPLLTRLV QRLVAGVTNL EARVVAVPNR
FFGPKVTVAG LLTGQDLLAE LGEAASWARE KKGLVILPDV MLKSDAPVFL DDRTPAMLAR
ELGVRVEIIP ATGEGLVAGI LEV