Gene Moth_1294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1294 
Symbol 
ID3831557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1334225 
End bp1335559 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content59% 
IMG OID637829231 
ProductL-glutamine synthetase 
Protein accessionYP_430151 
Protein GI83590142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID[TIGR00653] glutamine synthetase, type I 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0180309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGACA AAGAAAAGAG AGCAGCGGTC CTGCAACAGG CCGAGGAATG GGGGGTCAAG 
TTTGTCCGCC TCCAGTTTAC GGACATCTTT GGCGTACTCA AAAACGTAGC CATTCCCGTG
GACCAGCTCC CCAAGGCCCT CAATAACGAG CTGATGTTCG ACGGATCTTC AATTGAAGGG
TTCGTTCGCA TTGAGGAGTC CGATATGTAC CTGCGGCCGG ACCCGGATAC CTTTGTTGTT
TTTCCCTGGC GGCCCCATGA GGGTTCAGTG GCCCGGCTGA TCTGTGATGT CTACAACCCC
GACGGCACGC CCTTCGCCGG TTGCCCCCGC TCCACCTTAA AAAGGGTAAT GGCCGAAGCG
GCGGAAATGG GCTTTACCAT GAATGCCGGG CCGGAGGCCG AGTTTTTCCT CTTCCACACC
GACGCCGACG GCCGCCCGAC CCTGGAGACC CAGGACCGCG CCGGTTACTT TGACCTGACT
CCGGTGGATC TGGGTGAAGA CGCTCGCCGG GATATGGTCC TGACCCTGGA GCAAATGGGC
TTTGAGATCG AGGCCTCCCA CCATGAAGTG GCTCCCGGGC AGCACGAGAT TGATTTCAAA
TATGCCGAGG CCCTGACTAC CGCCGACCGG ATCGCTACCT TTAAATTTGT GGTCCGGACC
ATCGCCCAGC GGCATGGCCT CCACGCCACC TTTATGCCCA AACCCATCTA CGGCATCAAC
GGTTCCGGTA TGCATGCCAA CCTTTCCTTA TCTAAAGACG GTAAAAACGC TTTTGACGAT
CCCTCTGATG AACTGGGCTT AAGCCAGGTG GCCTACCATT TTATCGCCGG CATCATGGCC
CATGCCCGTG CCCTGACGGC CGTTACCAAC CCCACGGTTA ACTCCTACAA GCGCCTGGTG
CCGGGTTACG AGGCGCCGGT ATACATCGCC TGGTCGCCGC GCAACCGGAG CCCCTTGATC
CGGGTACCGG CCAAGCGGGG GGCCTCCACC CGGATTGAAG TGCGTCACCC GGACCCCTCC
TGCAACCCCT ACCTGGCCCT GGCTGTTCTC TTGAAGGCCG GTCTCGATGG TATCAAAAAG
GGCCTGACAC CACCGCCGCC GACGGATAAA AACATCTTTG CTATGACCCC GGCGGAGCTT
AAGGCAGAAG GAATCGGCGT CCTGCCCGGC AGCCTGGAGG AGGCCCTGGC GGCCTTGGAG
CAAGATGAAG TCATCCGCGA GGCCCTGGGA CCCCATATCT ACGAACGCTT GACCCTGGCT
CAAAAGATGG AATGCGAGGA GTATCGCACC CGGGTCCACC AGTGGGAGAT TGACCAGTAC
TTGACTAAAT TTTAA
 
Protein sequence
MDDKEKRAAV LQQAEEWGVK FVRLQFTDIF GVLKNVAIPV DQLPKALNNE LMFDGSSIEG 
FVRIEESDMY LRPDPDTFVV FPWRPHEGSV ARLICDVYNP DGTPFAGCPR STLKRVMAEA
AEMGFTMNAG PEAEFFLFHT DADGRPTLET QDRAGYFDLT PVDLGEDARR DMVLTLEQMG
FEIEASHHEV APGQHEIDFK YAEALTTADR IATFKFVVRT IAQRHGLHAT FMPKPIYGIN
GSGMHANLSL SKDGKNAFDD PSDELGLSQV AYHFIAGIMA HARALTAVTN PTVNSYKRLV
PGYEAPVYIA WSPRNRSPLI RVPAKRGAST RIEVRHPDPS CNPYLALAVL LKAGLDGIKK
GLTPPPPTDK NIFAMTPAEL KAEGIGVLPG SLEEALAALE QDEVIREALG PHIYERLTLA
QKMECEEYRT RVHQWEIDQY LTKF