Gene Moth_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1449 
Symbol 
ID3831335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1491892 
End bp1494009 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content56% 
IMG OID637829382 
Productanaerobic ribonucleoside triphosphate reductase 
Protein accessionYP_430302 
Protein GI83590293 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1328] Oxygen-sensitive ribonucleoside-triphosphate reductase 
TIGRFAM ID[TIGR02487] anaerobic ribonucleoside-triphosphate reductase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.437594 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACCGA AGAAAATCAT CAAGCGCGAC GGCCGGGTAG TGGACTTTGA CGAGGAAAGG 
ATTATCAATG CTATTTACAA AGCGGCCCAG GCTGTCGGGG GCCAGGACCG GCGCCAGGCC
AGCCAACTGG CCAACCAGGT GACGGCCCTT CTGGCGGAAA AATTTGTCGA CAGGCTGCCT
ACCGTCGAAG ACGTACAGGA CCTGGTGGAA AAAGTATTGA TTGAGAACGG CCATGCCCGG
ACGGCCAAGG CTTATATCCT TTACCGGCAA CAACATGCGG AATGGCGGGA TTTTCGCCAC
CTCCTGGTCA ACGTGCAGGA AATGGTCCAG GGCTACCTTG ACGGTCAGGA CTGGCGGATT
AATGAGAACA GCAACATGAA TTATTCCCTC CAGGGCCTCA ACAACCATAT CATTGCCGCC
GTCAGTTCCA GGTACTGGCT GGAAAAGGTC TACCCGCCGG CTATCCGCGA GGCCCATGAA
AGGGGCGATA TTCATATCCA CGATCTGGGG TTGCTGGCGC CCTATTGCTG TGGCTGGGAC
CTGGCCGATT TACTGGAAAG CGGCTTTGCC GGCGTGGGGC AGAAGGTGGA AAGCGCGCCG
CCCAGGCATT TCCGGACGGC CCTGGGGCAG ATAGCCAATT TCTTTTATAC CTTGCAGGGA
GAAGCAGCTG GTGCCCAGGC CTTTTCCAGT TTCGATACTT ACCTCGCTCC CTTTATCCGT
TACGACGGCC TGGACTACCG GGAAGTCAAG CAGGCCCTGC AGGAGTTTAT CTTTAATCTC
AATGTGCCCA CCAGGGTGGG TTTCCAGACG CCCTTTGTCA ACCTGACCAT GGATGTGGTC
GTACCGCAGG TTTTAGCCGG CGAGCCGGTG ATCATCGGCG GCGAGCGGCG TAAGGAATGC
TACGGCGACT TCCAGCAAGA GATGGACTGG CTGAATATAG CCTTTTGCGA GGTGATGATG
GAGGGGGACG CCCGGGGCCG CATTTTTACC TTCCCTATAC CCACGTACAA CATCACGCCT
GATTTTCCCT GGGAAAGCCC GGTGGCAGAC AGGATCATGG CCATGACGGC CAAATACGGC
ATCCCCTACT TTGCCAACTT TATCAACTCC GATTTAAAGC CCGAAGACGT ACGCAGCATG
TGCTGCCGCC TGCGCCTGGA TAACCGGGAA CTGAAGAAAC GCGGCGGCGG TCTTTTCGGC
GCCAATCCCC TTACGGGCTC TATCGGCGTA GTAACCATTA ATTTGCCCAG GCTGGGTTAC
GTGAGTTCAT CGAAAGAAGA GTTCTTCACC AGTTTAAAAG CGAAAATGGA CCTGGCCAAA
GAGAGCCTGC TCCTCAAGCG CAACATCCTG GAGAAGCTGA CGGAGCAGGG GCTTTATCCT
TACTCCCGAT TCTACCTGCG GCAGGTTAAG GAGCGTTTCG GTAGGTACTG GGAGAACCAT
TTTAATACCA TTGGTATCGT GGGCATGCAC GAGGCCCTCT TGAACTTCAT GAAGAAAGGT
ATCGAGACCC GCGATGGGAG GGGCCTGGCT TTAGAAATCC TGGACTTTAT GCGTTCTGTC
CTGGCCACCT ACCAGGCGGA AACCGGCCAG CTCTTCAATC TGGAGGCCAC CCCGGCGGAA
GGTACCTCTT ACCGCCTGGC CCGACTGGAC AGGCGTCTTT ATCCTGATAT TATCACCTCC
GGGACAAACG AACCGTATTA CACCAATTCC ACCCATTTGC CGGTCGGCTA CACTGACGAT
GTGTTTACCG CCTTGGAGCA CCAGGACGAG TTGCAACTAA AATATACCGG GGGTACGGTC
TTTCACGCCT ACCTGGGGGA AAGGGTGACG GATACGGTAA CCTGCCGACG TTTTCTGCAG
ACAGTGATGC ATAATTTCCG CCTGCCCTAT TTCACTATTA CGCCTACCTT CAGCATCTGC
CCCGAGCACG GGTATCTTTC CGGCGAGCAC TGGACATGCC CGGCCTGCGG CCGGGAAACA
GAGGTCTGGA GCCGGATCGT CGGCTATTAC CGGCCGGTGA AGAACTGGAA TAAAGGCAAG
CAGGAGGAGT TCCGGGAACG GCGGGGCTTC CGTCCTGTCT CCTGCGGCGA CGAACAAACA
CGCCAGGCGG CAGTTTAG
 
Protein sequence
MIPKKIIKRD GRVVDFDEER IINAIYKAAQ AVGGQDRRQA SQLANQVTAL LAEKFVDRLP 
TVEDVQDLVE KVLIENGHAR TAKAYILYRQ QHAEWRDFRH LLVNVQEMVQ GYLDGQDWRI
NENSNMNYSL QGLNNHIIAA VSSRYWLEKV YPPAIREAHE RGDIHIHDLG LLAPYCCGWD
LADLLESGFA GVGQKVESAP PRHFRTALGQ IANFFYTLQG EAAGAQAFSS FDTYLAPFIR
YDGLDYREVK QALQEFIFNL NVPTRVGFQT PFVNLTMDVV VPQVLAGEPV IIGGERRKEC
YGDFQQEMDW LNIAFCEVMM EGDARGRIFT FPIPTYNITP DFPWESPVAD RIMAMTAKYG
IPYFANFINS DLKPEDVRSM CCRLRLDNRE LKKRGGGLFG ANPLTGSIGV VTINLPRLGY
VSSSKEEFFT SLKAKMDLAK ESLLLKRNIL EKLTEQGLYP YSRFYLRQVK ERFGRYWENH
FNTIGIVGMH EALLNFMKKG IETRDGRGLA LEILDFMRSV LATYQAETGQ LFNLEATPAE
GTSYRLARLD RRLYPDIITS GTNEPYYTNS THLPVGYTDD VFTALEHQDE LQLKYTGGTV
FHAYLGERVT DTVTCRRFLQ TVMHNFRLPY FTITPTFSIC PEHGYLSGEH WTCPACGRET
EVWSRIVGYY RPVKNWNKGK QEEFRERRGF RPVSCGDEQT RQAAV