Gene Moth_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2018 
Symbol 
ID3831972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2104686 
End bp2106104 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content59% 
IMG OID637829947 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_430857 
Protein GI83590848 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAATC GAAAAACCGG GAATACCAAC CTCTGGAAAG TAGCGCTGGG TTCGGCCGGC 
CTGACCTGTT TGGGGCTATG GCTTTTCAGC CGCCCCCTGC TGAACCGCAT TCACGATTCT
TTCCTTAAAA CCGTGATGAC CGATCCTTAT GAAGAGAATT TCTGGGAATT TGTCTCGGCC
GCAAACCGTA CCGGTCTCCA GAAAATTGTC GAGACCAATT TACGGGCTCA GCAGGGGAAA
CTCATCCAGA GGCCCTTTGG CAGCCCCCGG CGTTTCCCCG GTACGGACGG CCTTGTTTTT
AACCTGGCGC AGCTGGCCAG GCTTCCTGTT GAAGAGGGTG TTCCCGTTGA TACAAAGGTA
ACCTTGGGGC CCCGGGCGGC CAAGCCCCTT AATATTAGCA TGCCGATCAT CATCTCCGGC
ATGGCCTACG GGCTGGCCTT AAGCGAAAAG ACCAAGATAG CCCTGGCGAG AGGAGCCAGC
CTGGCCGGTA CCGCCACTAA TACCGGTGAA GGGCCTTTCT TGCCTTCCGA GAGGCAGGCT
GCCAGGCACC TTATCGTCCA GTACAACCGC GGAGGCTGGA ACCACAACCC CCGTATACTC
AAACAGGCCG ACATGGTAGA AATCCAGTTC GGCCAGGCAG CCATAGGTGG CCTGGGCCAC
AGCACCAATT ACGGCGAGAT ACCTACCAAA GGTCGTCGCC TCCTGGGGAT CAAGCCCGGC
CAGGCGGCTG TCACCCATGC CCGTATGCCC GGTATAAAAG ACCCTAAAAA AGACCTGCCC
CCCCTCGTCA CCAGGTTACG CCACCTGACC GGGGGCGTTC CCATTGGCGC TAAAATCGGC
GCCGGCAATG ACCTGGAGAA GGACCTGGCC ATCCTCCTGG AAGCAGGCGT CGACTTTATT
GCCATCGATG GAGCCGGGGC GGCATCCAAG GGCTCACCGC CCATCGTCCA GGATGACTTT
GGCGTGCCCA CGGTCTATGC CGTCAACCGT GCGGCTACTT TTCTAAAAAA ACAGGGGGTA
AAGGACAGAG TGAGCCTGAT AGCCGGCGGC GGGCTGGTTA CCCCGGGCGA CTTTTTAAAA
ATCCTGGCCT TGGGTGCCGA CGCCGTTTAC ATCGGTACCA TAGCCCTCTT CGCCCTCACC
CACACCCAGG TCTTAAAGGC CATGCCCTGG GAACCCCCGG TCCAGGTTGT CTTTGCCCAG
GGCCGCTACC AGGATCAGCT GGATGAAGAC AGGGCGGCCC ACAATCTCGC CAATTTCTTA
TGGTCCTGCA ACGCCGAAAT CATGGAAGGC GTACGCGCCC TGGGCAAGAA ATCCGTCAAA
CAGGTCGACA AGTCCGACCT GGCCGCCCTG GACCCCGTTA CCGCCAGGGC TTTAGGCATC
CCCCTGGCGG CCCGGGCCAG GAGTTGCTTT CCCTCCTGA
 
Protein sequence
MGNRKTGNTN LWKVALGSAG LTCLGLWLFS RPLLNRIHDS FLKTVMTDPY EENFWEFVSA 
ANRTGLQKIV ETNLRAQQGK LIQRPFGSPR RFPGTDGLVF NLAQLARLPV EEGVPVDTKV
TLGPRAAKPL NISMPIIISG MAYGLALSEK TKIALARGAS LAGTATNTGE GPFLPSERQA
ARHLIVQYNR GGWNHNPRIL KQADMVEIQF GQAAIGGLGH STNYGEIPTK GRRLLGIKPG
QAAVTHARMP GIKDPKKDLP PLVTRLRHLT GGVPIGAKIG AGNDLEKDLA ILLEAGVDFI
AIDGAGAASK GSPPIVQDDF GVPTVYAVNR AATFLKKQGV KDRVSLIAGG GLVTPGDFLK
ILALGADAVY IGTIALFALT HTQVLKAMPW EPPVQVVFAQ GRYQDQLDED RAAHNLANFL
WSCNAEIMEG VRALGKKSVK QVDKSDLAAL DPVTARALGI PLAARARSCF PS