Gene Moth_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0938 
Symbol 
ID3832939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp971736 
End bp972959 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content60% 
IMG OID637828869 
Productnucleoside recognition 
Protein accessionYP_429798 
Protein GI83589789 
COG category[S] Function unknown 
COG ID[COG3314] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02871] sporulation integral membrane protein YlbJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000955347 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAAC CTGTATTCAT AATTAGCCGG GGTATAACCC CCTTCCTAAC AGCAGTGGCC 
GTCGTTATCC TGGCCCTGGC TATCGTCCTT TTCCCCCAGC CCGTTTTTCA GGCTGCCCTG
CGCGGTCTCC GGGCCTGGTG GGAAATCGTC GTCCCGGCCC TTTTACCTTT TTTTATTATT
TCTCAATTAT TTATGGGCCT GGGTATCGTC CACTTTCTGG GCGTGCTCCT GGAGCCGGTT
ATGCGGCCTC TCTTCAATGT CCCGGGCAGC GGCGCCTTTG TCATGGCCAT GGGGTACACT
TCTGGCGCCC CCATCAGCGC CATCCTTACC TCCCAGTTAC GCCAGCAGCA GCTGGTAACC
AGGGTTGAAG GGGAACGCTT AATCTGTTTC ACCAACAACG CCAGCCCCCT TTTTATGCTG
GGGGCAGTAG CCGTGGGTAT GCTCCATAAC CCGGCCCTGG GCCCGGCCCT GGCGGGAGCC
CATTATGGAG CCAACCTCTT CCTGGGAGTC CTATTTCGCT TCTACGGTCG ACGGGCACCG
GCTTCACCGC CGGGGAACCA CCCCCTCCTA TCCCTGCCAC GGAGAGCTTG GCGGGCCATG
ATTCAGGCCC AACAAAGGGA TGGCCGTTCC CTCGGCCAGC TCCTTGGGGA TGCCGTGAGC
CACTCCTTCC AGACCCTGAT TACCATCGGC GGCTTTATAA CCCTCTTCAG CGTCATTATC
CAGGTAGCCG GTATGCTGGG TATCCTGGAC CTCCTGGCCA GGTTGCTGCT TTATGCCGGC
CATCCCCTGG GTTTAACCCC GGCAACAGCC GGGGCCCTGG CCAGCGGTAT CTTTGAAATG
ACCATGGGGA CCAAGTTTGC CAGTGAAGCT CCCGTACCCC TTGGGGAGCA GCTCACTGCT
ATTAGTATCA TCATGGGCTG GGCCGGGCTC TCCGTCCTGG GCCAGGTGGC TGCCATGACC
AGCAAAACGG ATCTCCGCCT GGGTCCCTTT ATCCTGGCCC GCCTTCTCCA TGGTTTCCTG
GCGGCCTTCA TGGTCCAACT CTTCCGGGGA CCAGCCCGGC CAGTCCTTGG TTGGCTGACA
GGTAGCCATT TCCTGTCGCC CCCGGTATCA TGGCTTTCCC TGGGGGTCCA CTATACAGGG
TTTACCCTCA CCCTGGCGGC CTTGTTATTG TTCCTGACCG TGCTGGGATT GTTCGCCCGC
CTGACCCTTT ACCGGCGGTT TTGA
 
Protein sequence
MRQPVFIISR GITPFLTAVA VVILALAIVL FPQPVFQAAL RGLRAWWEIV VPALLPFFII 
SQLFMGLGIV HFLGVLLEPV MRPLFNVPGS GAFVMAMGYT SGAPISAILT SQLRQQQLVT
RVEGERLICF TNNASPLFML GAVAVGMLHN PALGPALAGA HYGANLFLGV LFRFYGRRAP
ASPPGNHPLL SLPRRAWRAM IQAQQRDGRS LGQLLGDAVS HSFQTLITIG GFITLFSVII
QVAGMLGILD LLARLLLYAG HPLGLTPATA GALASGIFEM TMGTKFASEA PVPLGEQLTA
ISIIMGWAGL SVLGQVAAMT SKTDLRLGPF ILARLLHGFL AAFMVQLFRG PARPVLGWLT
GSHFLSPPVS WLSLGVHYTG FTLTLAALLL FLTVLGLFAR LTLYRRF