Gene Moth_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0491 
Symbol 
ID3832814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp504556 
End bp505833 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content62% 
IMG OID637828425 
Producthypothetical protein 
Protein accessionYP_429364 
Protein GI83589355 
COG category[L] Replication, recombination and repair 
COG ID[COG1697] DNA topoisomerase VI, subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGATCA GTAAATACGA AGGGAGCCGC TCTTTTCAAA CCGGCACCCC GGGCAAGCAG 
CGGCCCCAGT TCGCCATGAA GAAGAGCCCC CTGGCCGGCG ATTATTTTGA TGAGATGGAC
CACCGCAAAA GGGAGGCCAT CCACGCCGCC CTGGCCGAAC TGGCGGCTGC CGGGGTGGTG
GAGGTCACCT GGCCCCGCTT CCAGGAAGGC CGCCAGGTGG AAAAGGTGTA TTTGAACTTT
GACGCCATCC CCCGGGCCTA TGAGCTGGCG GGGCTGGTGC CCAGGGCGGA ACGGATCTGC
AGGCTGCGCC AGGTCCTGGC TCCCCTGGCT ACCCACCCCT GGGAGTGGGT GCGGCGATGG
TGGGCCGGGG TGGACGCATC TTTAGGAGAA CGGCGGTCCG CAGGCCTGGA CCTGGAGGAC
CCGGAAGGCT ACGGGGAACT GGTCAAGGTG CTCCTGGCCC TGCCGGGATT GGAAGACAGC
ACACCGGAGC GCATCTTCAG CCAGCGGGTC CTGGGGGATT CCAAGGCCTT CGAGCAAAGG
GTGAAAAAGC GGTTGCTGGC CCTGCTCAAG TCTTACGGTC CGGAGGAATA TGAAACCGAC
GCCGAATATC TGGACAGCGT CGGCCTGACC GATAATCCCA AAATGGTCCT GGTGGCCGGA
CCCATGACTT TCCGGGTGGG AAGGACCACC GTCAATGTGG GGGGACTTCC GGGCGGCCTG
GGTCTGGCCG CTCATATGGT GCGGGCCATG GAGATAACGG CCGTTACCGC TCCTTGGGTT
CTCCTGGTGG AGAATTTGAC CAGTTACTAT CAGGTTGTCC AAAGTGTAAG TGAGCTGGCT
GTACCTTGCC GGGAAGGGGG AGGGGGCCGG GGTTTAGCGG TTGAGGGCCC TGCTGGCTTA
GTAGTATATA CCGGCGGCTT CCCCCACCGC GGCGTGCAGC TTTTTTTACG CCGGCTCCAG
GATTACCGGG AATCCCCTGG AGCCACCGCC AGGCCCCCGG TCTACCACTG GGGCGATATG
GACTACGGCG GCATCCGCAT TTGCGAATAT ATCCGCCGCA ACTTAATTCC CGATTTGCAG
CCCTACCTTA TGGATGTCAC TACTTATACC AGGTATTTGC CGGCCGGAAT ACCCTTCGGC
GACGAGTATG CCGCCAGGCT CCGGCACTTG GCTGAGGACC CGGCTTACGC CCCCTGGCAC
CCCCTCCTGC AGGCCATGCT GAAGCACCGC AAATGGGTGG AGCAGGAGAG CATTGCGATC
AATGTAAGCT GGGCGTAA
 
Protein sequence
MLISKYEGSR SFQTGTPGKQ RPQFAMKKSP LAGDYFDEMD HRKREAIHAA LAELAAAGVV 
EVTWPRFQEG RQVEKVYLNF DAIPRAYELA GLVPRAERIC RLRQVLAPLA THPWEWVRRW
WAGVDASLGE RRSAGLDLED PEGYGELVKV LLALPGLEDS TPERIFSQRV LGDSKAFEQR
VKKRLLALLK SYGPEEYETD AEYLDSVGLT DNPKMVLVAG PMTFRVGRTT VNVGGLPGGL
GLAAHMVRAM EITAVTAPWV LLVENLTSYY QVVQSVSELA VPCREGGGGR GLAVEGPAGL
VVYTGGFPHR GVQLFLRRLQ DYRESPGATA RPPVYHWGDM DYGGIRICEY IRRNLIPDLQ
PYLMDVTTYT RYLPAGIPFG DEYAARLRHL AEDPAYAPWH PLLQAMLKHR KWVEQESIAI
NVSWA