Gene Moth_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1139 
Symbol 
ID3833237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1167806 
End bp1170535 
Gene Length2730 bp 
Protein Length909 aa 
Translation table11 
GC content54% 
IMG OID637829069 
Producthypothetical protein 
Protein accessionYP_429996 
Protein GI83589987 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.827602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA ATCGTTTATG GTTCTGTCTT TTAATTATAA TCCCGGGCTT CCTGGTCGCA 
GCCTACCTGG GCTCCCACTT CCTGACCGAC TGGTACTGGT TTGCCGAAGT GGGCTACCGC
CAGGTCTTCC TTACCCGGTT GCTATCGGAA GTGGGAATAC GCCTGGGGAC TATAGCCTTT
TTCTTCCTTT TCTTTTATCT AAACCTCCTT TTTACCCGTA AAAGCCTCCA CCTATCGCCT
CCTGAAGGGC GGGAAAATTG GACTTTAAAG GAATACCTCA TCGACCGTTT TATTACCAGC
AGGCGTTTGG GTATCCTGTA TCTTTTACTA AGCCTGGCGG GGGCCCTCAT CTTCAGCCCC
CTGGCCGCCG GTAAGTGGCT GGTGGTCCAG GAGTACCTCA GGGCCACCCC CTTCGGCCTT
GCCGATCCCC TTTTTGGCAG GGATGTTAGT TTCTATATCT TTAAACTGCC CCTTTACCAT
TTTCTCTATA AATTGCTGAT AACGGCTGTG GTTGGGGCCG TCCTGGTCAC CGGCTTCTTC
TACTTCATCT TTAACCCGCG GGAGCTCCTG GGCCTGCGGC GGGGCCATTT TTCCCGTCCT
CTGGTCCATT TTTCCACCCT GGTAGCCCTA CTCTTTCTAA TTCAAGCCTG GGGGTTCCGC
TTGCAGGCCC TTGATCTGGT TCGATCATCC CGGGGTGTGG CCTTCGGCGC CAGCTATACC
GATATCCACG CCCTCCTTCC TGGCTATAAC ATCCTGGGAT GGGTAGCCGT AGCCTGCGGC
CTGATTATTG TCCTTAACGC CTTTCGCCGT AACCTAAAGC TAGTCAGTGC CGGTATTCTA
TCCTTTATGG CCGCCTATTT CCTGCTGGTG ATAGCTGTAC CCCTGGCAGT GCAAAAATTC
CAGGTCGAGC CCAACGAGTT CGCCCGGGAG GAGCCCTACC TGCGCTATAA TATTAACTTT
ACCCGCCGGG CCTATGGACT GGACAGGATC ACCATCCAGG AATTCCCGGC TCTGGACAAC
TTGACCCCCG CCAGTCTGAG GGAAGAAGGG GCCACCCTGG ACAACATCCG CCTGTGGGAT
TACCGACCCC TGGAGCAAAC CTACAGCCAG CTCCAGGAGA TCCGTTCTTA TTATAGTTTT
AAGGATATCG ATGTCGACCG CTACACCCTG GATGGCAGGG AGCGGCAGGT CATGCTGGCG
GCCCGGGAAC TGGACCAGAA CAAATTGCCT GACCGGGCCC GGACGTGGAT CAACGAAAAA
ATGCGCTATA CCCACGGCTA CGGTCTGGCC ATGAACCCGG CCAACACCGT TACTGCCGGC
GGCCAGCCGG AGTTTATCGC CGGCGACCTG CCCTTTCACA GCAGTGCCGG CCTCCAGGTT
AATGAGCCCC GGATCTATTA CGGTGAACTG ACCGGTGATT ATGTCATTAC CGGCGGTACG
GCAGCTGAAT TTGATTACCC TGTCACGGGA GAAGATAATT TCGTTGAAAC CCGATATCAG
GGAAGAGGCG GGGTTCCCAT TAATACTCCC TGGCGGCGGC TGGTCTTCGC TTTTCGCTTT
CACGATTACC GGCTGCTGAT GAGCAACGGG CTAACTCCCC AGAGTAAGAT ACTCTATTAC
CGCAATATCC AGGAACGGGT GCGGAAGATC ATGCCCTATT TGCGCTATGA CGCTGATCCC
TACCTGGTTG TTGCCGGGGG CCGCCTTTAC TGGTTCCTAG ACGCCTATAC CATTACTAAT
ATGTATCCCT ATTCCGAACC AAACAGCGGC GGCTTCAATT ATATTCGCAA CTCTGTCAAG
GTGGTTATCG ATGCTTATAA CGGTAGTGTT GACTACTATC TTGTTGACCC GGGAGACCCC
CTGGCCCAAA CCCTGGCCAG AATTTTCCCC GGGCTATTCA AGCCCCGGGA AGACATGCCG
GCCGGGCTGC AGCAGCACCT GCGTTATCCC CCCGACCTTT TAAGCATCCA GGCCCAGATG
TTGACTAACT ACCATATGGA AAACACCATG CTTTTTTATA ACAAAGAGGA TGCCTGGAGT
ATAGCTGAGG AAATGGTCGG CGATAAACGC CAGGCCATGG ATCCCTATTA CACCCTGATG
CGTTTACCCG GGGAAACACA GGCGGAGTAT ATCTTAATGC TTCCCTTTAC ACCAGCCCGT
AAGGTCAATA TGATAGCCTG GCTAGCAGCC CGCAATGATG GTCCCCATTA CGGCCAGCTT
TTGTTATACC AGTTCCCTAA AAACCGCTCT ATTTATGGCC CTATGCAGGT CGAAGCCCGT
ATCGACCAGG AACCGCGTAT CTCCCAGCAG CTAACCCTCT GGGACCAGCA TGGCTCCCAG
GTCATCCGGG GGAATCTTCT GGTAATCCCC ATTAAAGGCT CCCTGCTTTA TGTAGAGCCA
ATCTTCCTCC AGGCCCAGGA AAGTAAATTA CCTGAACTGC GCCAGGTGGT AGTTGCTTAC
GAAGAAAAAA TAGCCATGGC CGACACCCTG GCCGGGGCCC TGCAGGTCAT CTTCGGTACC
CAGACACCTG CACCCGCCGC CAGTCCTCAA CCGCCATCCC AGGCTGCAAC AGGCAGCCCA
GGTAACCTGT CGGAACTCAT CAAAGAAGCC AACCGCCTCT ATAGCGAAGC CCAGGACAGG
CTAAAACAGG GTGATTGGGC CGGTTACGGG GAAAACCTGA AAAAGCTGGA GCAGGTCCTC
CAGGAGATGG GACAAAAAGT AGCTGAATGA
 
Protein sequence
MKLNRLWFCL LIIIPGFLVA AYLGSHFLTD WYWFAEVGYR QVFLTRLLSE VGIRLGTIAF 
FFLFFYLNLL FTRKSLHLSP PEGRENWTLK EYLIDRFITS RRLGILYLLL SLAGALIFSP
LAAGKWLVVQ EYLRATPFGL ADPLFGRDVS FYIFKLPLYH FLYKLLITAV VGAVLVTGFF
YFIFNPRELL GLRRGHFSRP LVHFSTLVAL LFLIQAWGFR LQALDLVRSS RGVAFGASYT
DIHALLPGYN ILGWVAVACG LIIVLNAFRR NLKLVSAGIL SFMAAYFLLV IAVPLAVQKF
QVEPNEFARE EPYLRYNINF TRRAYGLDRI TIQEFPALDN LTPASLREEG ATLDNIRLWD
YRPLEQTYSQ LQEIRSYYSF KDIDVDRYTL DGRERQVMLA ARELDQNKLP DRARTWINEK
MRYTHGYGLA MNPANTVTAG GQPEFIAGDL PFHSSAGLQV NEPRIYYGEL TGDYVITGGT
AAEFDYPVTG EDNFVETRYQ GRGGVPINTP WRRLVFAFRF HDYRLLMSNG LTPQSKILYY
RNIQERVRKI MPYLRYDADP YLVVAGGRLY WFLDAYTITN MYPYSEPNSG GFNYIRNSVK
VVIDAYNGSV DYYLVDPGDP LAQTLARIFP GLFKPREDMP AGLQQHLRYP PDLLSIQAQM
LTNYHMENTM LFYNKEDAWS IAEEMVGDKR QAMDPYYTLM RLPGETQAEY ILMLPFTPAR
KVNMIAWLAA RNDGPHYGQL LLYQFPKNRS IYGPMQVEAR IDQEPRISQQ LTLWDQHGSQ
VIRGNLLVIP IKGSLLYVEP IFLQAQESKL PELRQVVVAY EEKIAMADTL AGALQVIFGT
QTPAPAASPQ PPSQAATGSP GNLSELIKEA NRLYSEAQDR LKQGDWAGYG ENLKKLEQVL
QEMGQKVAE