Gene Moth_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1454 
Symbol 
ID3831340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1499828 
End bp1501063 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content56% 
IMG OID637829387 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_430307 
Protein GI83590298 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.534279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA CAGACATTCT GGTTGCCGGT GGAGGTATTG CCGGGTGCAC CGCGGCGCTG 
GCTGCCAGGC GATACTACAG CGATAAAAAA ATCACCCTGG TACGCCGGGA AGTCAGGGCG
CTGATGCCCT GGGGCCTTGC TTACGCCTGT GGTGCCGGGT CATTAAATGA GTATATCCTG
GGCGACTCCC GGCTTTATAA AGAGGGAATT GAACTGGTAA TCGATGAAGT GACGGCCATT
GATCCCGGGG GTAAACGGGT TACCACTGCC TTTGGTGAAA AAATAGCTTA CGATAAACTG
ATCCTCGCCA TTGGTTCTTC GCCGGTCACT TCTTTACTTC AAGGAACGGA ACTCCCGGGC
GTTTTTGTTT TGAAAAAAGA GCTTCCCTAC CTTAAAAGCC TTAAAGAGCA CCTGGCCAGG
GCCAGGAACG TGGTTATCGT TGGTGGCGGG CTAAACGGCG TAGAACTGGC AGCGGCCTGC
AGCGCCAACC ACCAGCTTCA CATCACCCTG GTAGAACAAC TACCCCATTG CCTGTCCGGG
GTCTTTAATG ACGATACTTG CATTTTAATA GAAGAAAAAC TGCGCCGGAA GGGTGTTGCC
ATTATAACCG GAGCGGCAGC GGAAGGACTG GAAGGCTGCC ATCGAGTAGA GGGTGTCAGG
CTAACGGGTG GACGGACTTT ACCTGCTGAT GTGGTGGTCC TGGCCACCGG TATCGTACCC
AATACCCTCC TGGCCCGGCA GGCCGGTCTG GCAACCGACG AAAATGCCGG CATCCTGGTG
GATGAGTATA TGCAGACCAG TGCCACCGAT GTCTTTGCCA TCGGCGACTG CGCCGCTCAA
AAATCCCTTG TCCCTACCGG CGGTTCTCTT ACCAGGCAGG CTGGACCGGC CGGCCACGAG
GCCCGCGTAG CCGCCGCCAA CCTCTTCGGC CTGAAGCGAG CCAGGGAAAT TACTGTTAAG
AAGATCTCTG TAGCTATCGG GGACCTGGTC TTTGGCTCCG TGGGCCTCAT AAAAATTTCC
CTTGCGGAGA CCGGGACCGG AATGCCGACC ACCGCCCTTG CTCACGATGT GATCGCCAAA
GATCTGGCGG TCAAGGTAGT TTATGTCCGG GAAACCGGGG CCACCCTGGG CGCTGAGGTC
TACGGTAAAC CCCTCATCCG GGTGCGGGAA ACCATGAACA ATCTCGCGTC TGCTATTGAA
CGGCAAACAC CCTTTGCCGG TCTGGCCCTG GCCTAA
 
Protein sequence
MKRTDILVAG GGIAGCTAAL AARRYYSDKK ITLVRREVRA LMPWGLAYAC GAGSLNEYIL 
GDSRLYKEGI ELVIDEVTAI DPGGKRVTTA FGEKIAYDKL ILAIGSSPVT SLLQGTELPG
VFVLKKELPY LKSLKEHLAR ARNVVIVGGG LNGVELAAAC SANHQLHITL VEQLPHCLSG
VFNDDTCILI EEKLRRKGVA IITGAAAEGL EGCHRVEGVR LTGGRTLPAD VVVLATGIVP
NTLLARQAGL ATDENAGILV DEYMQTSATD VFAIGDCAAQ KSLVPTGGSL TRQAGPAGHE
ARVAAANLFG LKRAREITVK KISVAIGDLV FGSVGLIKIS LAETGTGMPT TALAHDVIAK
DLAVKVVYVR ETGATLGAEV YGKPLIRVRE TMNNLASAIE RQTPFAGLAL A