Gene Moth_1179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1179 
Symbol 
ID3832982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1209566 
End bp1211206 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content49% 
IMG OID637829112 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_430036 
Protein GI83590027 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000368179 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAT GGCCAAAACA AAATGATGTC CTGGGAACCG TTAATCGCGG CAATCCGGCA 
GAATCTGGTC TATGTACCTT ATGCCGGGCA GATTGCCAGG GGAAATGCGA GACCTGGATG
TCCAGCATGG TGGGGCGCAA GCTCCTCTAT CCCCGGGATT TCGGTACTGT TACAGCCGGA
AGCTCAAACA CAACCCATGT AGGAATTTCT TATAACTCCT TGAGGATCCT GGGATACAAC
TACGGCGCTT ATGGTCTAAC AAAAGGCCTC TCCAATAGTG CTGATGACTG TATTTTCCCC
AATGTAAATG TCGAAACCGA ATTTGGTAGT GAGGTAAAAA CCAAAGTCAA AGCCCCGATC
ATGACGGGAG CCTTAGGATC TACCTTTATC GCCGCGAAGT ACTGGGATTC CTTTTCTATA
GGTGCTGCCC TGGTTGGAAT CCCGATCGTT ATTGGGGAAA ATGTCGTAGG GATTGACAAG
GAAGCCGTTA TTGAAAATGG TAGGATTGTC AAAGCCCCGG AATTGGATCG ACGTATTCAG
ACCTACTTGA AATACTATGA TGGATTTGGT GCCATTATCG TCCAGATGAA CGTAGAAGAC
ACCCGCAACG GTGTGGCCGA ATACGTTATC GAGAAGTACG GCGATCAGGT TATTCTGGAG
CTGAAGTGGG GTCAGGGCGC CAAGGACATC GGCGGAGAAA TTCAGGTTAC CGATCTCGAG
TATGCCATCT TCTTGAAGAA CAGGGGCTAT GTGGTCGATC CTGATCCAAC CATTCCGGAA
GTTCAGGAAG CCTTTAAGAG CGGGGCTATT AGATCCTTTG CCCGCCACAG CCGCTTGGGC
TATACGAATC TCACCAGTTT TGAGCAGGTG AGAGAAAACT TTATGACAGC CATTGAGTAC
CTCCGCGGAC TGGGCTACAA GCGGATTACC TTGAAGACCG GCTCCTATGG AATGGAAGCC
CTGGCTATGG CTATCAAGCT CGCTTCTGAT GCGAAACTTG ATTTGCTTAC CGTGGACGGC
TCAGGCGGCG GTACCGGCAT GAGCCCGTGG AACATGATGG AAACCTGGGG CGTTCCCTCT
ATTCTGCTTC ATTCTAAAGC CTATGAATAT GCCAGCCTGT TAGCTGCCAG GGGTAAGAAA
GTTGTCGATA TGGCCTTTGC CGGCGGATTG GCCAGGGAAG ATCATATTTT CAAAGCTTTA
GCTTTGGGTG CGCCTTACGT TAAACTGGTA TGTATGGGGC GAGCCTTAAT GATTCCGGGC
TTTGTCGGGT CCAACATTGA AGGTGTGCTC CATCCTGAAA GACGCGCGAA AGTCAACGGA
AACTGGGATA GTTTACCCAG GACTGTGGCC GAATTAGGTA CGAAAGCTGA AGAGATTTTT
GCCGGTTACT ATGATGTTCA AAAGAAAGTC GGCGCTGAGG AGATGAAGAA TATTCCTTAC
GGTGCCATTG CCTTCTGGAC ACTAGCCGAC AAACTGACGG CAGGATTACA GCAATTAATG
GCTGGAGCCC GCAAATTCTC CCTGGATCAA ATCACCAGAA ATGATATCGC TTCCGCTAAC
CGGGAAACCG AAGCCGAAAC AGGTATACCC TTTATTACCG ACGTTCAGGA CGAATTAGCC
CGGAAGATTC TCCTTAGTTA A
 
Protein sequence
MIQWPKQNDV LGTVNRGNPA ESGLCTLCRA DCQGKCETWM SSMVGRKLLY PRDFGTVTAG 
SSNTTHVGIS YNSLRILGYN YGAYGLTKGL SNSADDCIFP NVNVETEFGS EVKTKVKAPI
MTGALGSTFI AAKYWDSFSI GAALVGIPIV IGENVVGIDK EAVIENGRIV KAPELDRRIQ
TYLKYYDGFG AIIVQMNVED TRNGVAEYVI EKYGDQVILE LKWGQGAKDI GGEIQVTDLE
YAIFLKNRGY VVDPDPTIPE VQEAFKSGAI RSFARHSRLG YTNLTSFEQV RENFMTAIEY
LRGLGYKRIT LKTGSYGMEA LAMAIKLASD AKLDLLTVDG SGGGTGMSPW NMMETWGVPS
ILLHSKAYEY ASLLAARGKK VVDMAFAGGL AREDHIFKAL ALGAPYVKLV CMGRALMIPG
FVGSNIEGVL HPERRAKVNG NWDSLPRTVA ELGTKAEEIF AGYYDVQKKV GAEEMKNIPY
GAIAFWTLAD KLTAGLQQLM AGARKFSLDQ ITRNDIASAN RETEAETGIP FITDVQDELA
RKILLS