Gene Moth_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1517 
Symbol 
ID3831982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1562704 
End bp1564095 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content60% 
IMG OID637829449 
Productputative oxidoreductase 
Protein accessionYP_430369 
Protein GI83590360 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01316] glutamate synthase (NADPH), homotetrameric 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTAA TCCCTAAGAA GACCCCTATG CCTTCCCAGG AACCCAGGGA GCGGATTCAT 
AACTTCAACG AAGTAGCGCA GGGTTATACC CGGGAAATGG CCCTGGCTGA GGCTCAGCGT
TGCCTTCAGT GCAAAAAGGC CCCCTGCCGC CAGGGTTGCC CGGTAGAAGT AGATATACCG
GCCTTTATCG CCCGCTTGAA GGAGCAGGAC TTTGACGGCG CCATTGCCAA GATCAAAGAA
AAGAATAACC TGCCGGCCAT CTGCGGCCGG GTTTGCCCCC AGGAAAACCA GTGTGAAAAA
TTCTGCACCC TCGGCAAAAA ACACGAGCCA GTGGCCATCG GTCGCCTGGA GCGCTTCCTG
GCCGATTACC AGCTGGCCAA AGGCGAGACG GCCAGTAGCG AAAAGGCGCC ACCCAGCGGT
TACAAAGTGG CGGTCATCGG TTCCGGGCCG GCCGGCCTGA CGGCTGCTGC CGACCTGGCC
AGGATGGGAC ATCAGGTAAC AGTCTTTGAG GCCCTCCATG TACCCGGGGG CGTGCTCATG
TACGGTATTC CCGAGTTCCG GTTGCCGAAA AGGATTGTTC AACAGGAGAT AGATACTATT
CGCCGTCTTG GGGTGGAGAT CCGGACCAAT GCCGTGGTAG GCAAGCTGAC TACGGTGGAC
GAGTTGCTGG AGAACGGTTA CGACGCCGTC TTCATCGGTA CGGGGGCCGG GTTGCCCCAC
TTTATGGGGA TTCCCGGGGA GAACCTCCTG GGCGTTTACT CGGCCAATGA GTTTTTGACC
CGGACCAACC TGATGAAGGC CTACCTATTC CCCCGGTATG CTACTCCCAT CAAGGTCGGG
AAACGGGTGG CCGTCATCGG CGCCGGCAAT GTAGCCATGG ATGCCGCCCG TACAGCTTTG
CGCCTGGGAG CCGAGGAATC CTATATCGTT TACCGCCGTT CGGCAGCGGA GATGCCAGCC
CGCAAGGAAG AAGTCGAGCA CGCCGAGGAA GAAGGCGTCC AGTTCCGCCT CCTAACCAGT
CCTGTGCGTA TCCACGGCAA CGACCAGGGC GTAGTTACAG GCATGACCTG CCAGCGCTTC
GAACTGGGCG AACCCGATGC CTCCGGCCGG CGACGTCCGG TGCCCATTCC CGGTTCTGAG
TACGACATGG CGGTCGATAC CGTCGTCATC GCCATCGGCC AGGGACCTAA TCCCTTGGTC
CTGCGGACAA CTCCGGGCCT GAAGCTCACA AGCAAGGGAA CCATTGCCGC CGACGAGGCA
ACCGGTGCCA CTTCCCGTAA GGGCGTTTTT GCCGGCGGTG ACATCGTCAC CGGGGCGGCT
ACCGTCATCC TGGCCATGGG AGCCGGTAAA GCGGCCGCCC GGGCCATTGA CGCCTATCTA
CGGGAAAAAT AA
 
Protein sequence
MPLIPKKTPM PSQEPRERIH NFNEVAQGYT REMALAEAQR CLQCKKAPCR QGCPVEVDIP 
AFIARLKEQD FDGAIAKIKE KNNLPAICGR VCPQENQCEK FCTLGKKHEP VAIGRLERFL
ADYQLAKGET ASSEKAPPSG YKVAVIGSGP AGLTAAADLA RMGHQVTVFE ALHVPGGVLM
YGIPEFRLPK RIVQQEIDTI RRLGVEIRTN AVVGKLTTVD ELLENGYDAV FIGTGAGLPH
FMGIPGENLL GVYSANEFLT RTNLMKAYLF PRYATPIKVG KRVAVIGAGN VAMDAARTAL
RLGAEESYIV YRRSAAEMPA RKEEVEHAEE EGVQFRLLTS PVRIHGNDQG VVTGMTCQRF
ELGEPDASGR RRPVPIPGSE YDMAVDTVVI AIGQGPNPLV LRTTPGLKLT SKGTIAADEA
TGATSRKGVF AGGDIVTGAA TVILAMGAGK AAARAIDAYL REK