Gene Moth_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0007 
Symbol 
ID3831879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp4828 
End bp6732 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content53% 
IMG OID637827934 
ProductDNA gyrase, B subunit 
Protein accessionYP_428890 
Protein GI83588881 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01059] DNA gyrase, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000328066 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000130354 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGATCAGG TTCGTAATGA ATATGATGCT AGCCAAATCC AGATTCTAGA AGGTCTGGAA 
GCTGTGCGCC GGCGACCGGG CATGTACATC GGCAATACCG GCGTGCGGGG CCTGCACCAG
CTGGTCTTTG AATTGGTCGA CAACAGCATC GACGAGGCCC TGGCCGGCTT TTGCGACCGG
ATTGAGGTTA CGATTCACGA GAACGGCAGC CTGACAGTCG CTGATAACGG CCGGGGTATT
CCGGTGGATA TCCATGAAAA GACCGGCCTG CCGGCGGTGG AGGTAGCCCT GACCATCCTC
CACGCCGGGG GAAAATTTGG CGGGAACGGT TACAAAGTAG CCGGGGGTCT CCATGGTGTC
GGCCTTTCAG TGGTCAATGC CCTGTCGGAG TGGCTGGAGA TAAAGGTTAA ACGTAATGGT
AAGATTTATC ATCAGGAGTA CCGGCGCGGC CAGAAGGTGT CCGAGTTAAA AGTTATTGGT
AAAACCAAAG GTACGGGCAC CAGCGTAACT TTTTATCCTG ACGGTGAGAT TTTTGAAGAC
CTGGTTTTCC AGGACGAGAT AATCGGCCGC CGGTTACAGG AGCTTTCCTT CCTGAACCGG
GGCGTGAAGA TAGTCTTCCG CGACGAAAGG AACGAAAGCG AGACCACCTA TTACCACACC
GGTGGTTTAA TTGACTTTGT CCGCCATTTA AATAAAAACA AAACCGTCCT CTTCAACAAG
CCCCTTTACT TCAGCGGTGA AAAGGATGAC GTCCAGGCAG AGATAGCCAT TCAGTATAAT
GACGGCTATA ATGAACTTAT TCTGTCCTAT GCCAATAACA TCCATACTGT TGAGGGTGGC
AGCCACGAGA TTGGTTTTAA AACCGCTTTA ACCAGGGTGA TCAACGATTA TGCCCGCCGC
TTTAATCTTT TGAAGGACGC TGAGGCCAAC CTCTCCGGCG AGGACATCCG GGAGGGCCTG
ACGGCCGTCA TTAGCGTCAA GGTCCTCGAG CCCCAGTTTG AAGGCCAGAC CAAGACCAAA
CTGGGTAACA CGGAGGTACG AGGGATTGTC GACAGCCTGG TAGCCGAAAA CTTGAGCGCT
TACCTGGAGG AAAATCCCAC CATTGGCCGG AGGATTGTTG ACAAAGCCCT CAACGCCTTT
AGGGCCCGTG AAGCAGCTCG TAAGGCCCGG GAACTTACCC GGCGTAAAAA CGCCCTCGAG
ATAACCTCCC TGCCGGGTAA ACTGGCCGAC TGTACCCATA AGGACCCGGC TATGGCCGAA
CTCTTTCTGG TAGAAGGCGA TTCCGCCGGC GGTTCAGCCA AACAGGGCCG GGATCGCCGT
TTCCAGGCCA TCCTGCCCCT GCGGGGCAAG ATCTTGAATG TCGAGAAGGC CAGGCTGGAT
AAAATCCTCA ACAACGAAGA GATCCGGACC ATCATCACCG CCCTGGGGAC GGGCATAGGC
GATGACTTTA ATATCAATAA GGCCCGTTAC CACAAAACCA TCCTCATGGC CGATGCCGAT
GTCGACGGTT CCCACATTCG TACCCTTTTA TTGACCTTTT TTTACCGCTA TATGCGGCCG
CTAATTACCG AAGGTTATAT CTACATCGCC CAGCCGCCTC TTTATAAAGT CTACCGGGGC
AAGGTTGAGC GTTACCTCTA CAACGATGCC GAATTAGAGA AGTTCCTCAA AGAACATGAA
GGCGAGCGCT GGGAGATCCA GCGTTACAAA GGCCTGGGTG AAATGAACCC TGAACAACTC
TGGGAAACCA CCATGAACCC CGAGTCGCGG ACCCTCCTTC AGGTTAACCT TGAAGACGCC
ATGGAGGCCG ACGCCATCTT TAACATCCTC ATGGGCGACC GGGTGGAACC GCGACGGGAA
TTTATCCAGC AGCATGCCCA CGAAGTCCGC AACCTGGACA TTTAA
 
Protein sequence
MDQVRNEYDA SQIQILEGLE AVRRRPGMYI GNTGVRGLHQ LVFELVDNSI DEALAGFCDR 
IEVTIHENGS LTVADNGRGI PVDIHEKTGL PAVEVALTIL HAGGKFGGNG YKVAGGLHGV
GLSVVNALSE WLEIKVKRNG KIYHQEYRRG QKVSELKVIG KTKGTGTSVT FYPDGEIFED
LVFQDEIIGR RLQELSFLNR GVKIVFRDER NESETTYYHT GGLIDFVRHL NKNKTVLFNK
PLYFSGEKDD VQAEIAIQYN DGYNELILSY ANNIHTVEGG SHEIGFKTAL TRVINDYARR
FNLLKDAEAN LSGEDIREGL TAVISVKVLE PQFEGQTKTK LGNTEVRGIV DSLVAENLSA
YLEENPTIGR RIVDKALNAF RAREAARKAR ELTRRKNALE ITSLPGKLAD CTHKDPAMAE
LFLVEGDSAG GSAKQGRDRR FQAILPLRGK ILNVEKARLD KILNNEEIRT IITALGTGIG
DDFNINKARY HKTILMADAD VDGSHIRTLL LTFFYRYMRP LITEGYIYIA QPPLYKVYRG
KVERYLYNDA ELEKFLKEHE GERWEIQRYK GLGEMNPEQL WETTMNPESR TLLQVNLEDA
MEADAIFNIL MGDRVEPRRE FIQQHAHEVR NLDI