Gene Moth_2504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2504 
Symbol 
ID3832776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2610075 
End bp2611487 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content59% 
IMG OID637830427 
Productselenocysteine synthase 
Protein accessionYP_431329 
Protein GI83591320 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1921] Selenocysteine synthase [seryl-tRNASer selenium transferase] 
TIGRFAM ID[TIGR00474] seryl-tRNA(sec) selenium transferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0196405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0690834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCCA GGAACTTATT GCGGCAATTG CCGGCGGTAG ATCAATTATT ACAGCATCCC 
CGGCTAAAGG ATCTTAGCCG GGAAAACTAT AAAATGGTAC TGGCTCTTAC CCGCCAGGTC
CTGGATGATT GGCGCCTGAA GATAAAAAAC GGGGCTACCA CCATACCTGA TCCAGGTCAA
TTAGCACGGG AAATAGAAAA TAGATACCAT GAGGCAGGCC GCAGTAGTTT ACGGCCGGTA
ATCAACGCCA CCGGTGTGGT TTTACATACC AACCTGGGCC GGGCTATTTT AAGCCCGGCC
GCCCGGGCAG CAGCCCTGAC GGCAGCCGGG CGCTATACCA ACCTGGAATA TGATCTGGAG
AAGGGGCAGC GCGGCAATCG CTACAGCCAT GTAACAGGGC TGTTGAAGGA ACTTACAGGG
GCGGAAGAAG CCCTGGTGGT CAACAATAAT GCCGCCGCCG TCCTCCTGGC CCTGTCGACC
CTGGCGGCAG GACGGGAGAC CATTATTTCC CGGGGCCAGC TGGTGGAGAT CGGGGGTTCC
TTCCGAATAC CGGAGGTCAT GGGCCAGAGC GGTACCAGAC TGGTGGAAGT GGGTACGACT
AATAAAACCT ACATCCATGA TTATGAAAGA GCTGTGGGTC CGGATACGGC TCTACTGCTC
AAGGTCCATC CCAGTAACTA TCGTATCCAG GGGTTTACCC GGGAAGTCAC CACTGCTGAA
CTGGTGGAAC TGGGGCGCCG TGTGGGGGTG CCGGTCATGG AAGACCTGGG CAGCGGCTTT
CTTATCGACC TGGAGGCCTA TGGTATAACC GGAGAACCGA CAGTCCAGGC AGAAATAAAC
CAGGGAGTAG ACGTAGTAAC CTTCAGCGGC GATAAATTAC TGGGAGGACC CCAGGCGGGC
ATTATTGTCG GCCGCCGGGA CCTGGTAGCG GCCATGGCCG GCCATCCTCT CACCAGAGCC
CTGCGGATTG ATAAAATGAA CCTGGCCGCC CTGGAGGCGA CCTTGCGGGC TTACCGCAAT
CCGGACCGGG CCGTTAAAGA GATCCCCACC CTGGCGGCCC TGGTGGCTTT ACCGGAGGAC
CTGCGCCTCC GGGCCGAGGA GCTACAAAAA TTGCTGACCA GCGTACTTGG TTCCCGGGCC
AGGGTTGGGT TGATGCCTAC TACCTCTCAG GCCGGCGGCG GTTCCCTGCC TGTAACTGAA
CTACCATCCT GGGCCATAAC CATCCGTCCT GAACAAGGAG GAGCAGCCGG ACTGGTAACT
GCCCTCCGCC GGACGGACCC ACCGGTCCTG GCGCGGGTCC AGGACGACCT CCTGCTCCTT
GACGTCAGGA CTCTGTTGCC GGGCGAGGGC GAGGAACTCG CCCGGGCCCT GGTTCAGGCC
CTGGAGGGAG CCGTCCATGG TGGTGAGTCG TAA
 
Protein sequence
MESRNLLRQL PAVDQLLQHP RLKDLSRENY KMVLALTRQV LDDWRLKIKN GATTIPDPGQ 
LAREIENRYH EAGRSSLRPV INATGVVLHT NLGRAILSPA ARAAALTAAG RYTNLEYDLE
KGQRGNRYSH VTGLLKELTG AEEALVVNNN AAAVLLALST LAAGRETIIS RGQLVEIGGS
FRIPEVMGQS GTRLVEVGTT NKTYIHDYER AVGPDTALLL KVHPSNYRIQ GFTREVTTAE
LVELGRRVGV PVMEDLGSGF LIDLEAYGIT GEPTVQAEIN QGVDVVTFSG DKLLGGPQAG
IIVGRRDLVA AMAGHPLTRA LRIDKMNLAA LEATLRAYRN PDRAVKEIPT LAALVALPED
LRLRAEELQK LLTSVLGSRA RVGLMPTTSQ AGGGSLPVTE LPSWAITIRP EQGGAAGLVT
ALRRTDPPVL ARVQDDLLLL DVRTLLPGEG EELARALVQA LEGAVHGGES