Gene Moth_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1878 
Symbol 
ID3831222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1941224 
End bp1942249 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content51% 
IMG OID637829810 
Producthypothetical protein 
Protein accessionYP_430721 
Protein GI83590712 
COG category[S] Function unknown 
COG ID[COG2855] Predicted membrane protein 
TIGRFAM ID[TIGR00698] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000395194 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA GGAATCCAAC AATGACTGAA GGGAACCAGG GCTTGGGTCT TATCCAGGGG 
GTGGGCCTGA CTGTAATTCT TACCCTGGTT GCCAGGCAGC TTGCCATGTT ACCGATTCTA
AAAATCATGG GCAGCATGGT TCTTGCTATC CTTCTGGGTG TTGCCTGGCG TTCCCTTATG
GACATTCCAG CAACGGCAGA GGTGGGCATT AATTTCGCCA GCAAAAAAAT TCTCCGTTAT
GGCATTATCC TCATGGGACT GCGTCTGGAT ATCCCTAAAA TTATTGCTGC CGGCCCGCAG
GTAATTCTCC TTGACATCCT GGCCATCTTA GTGTCTATGG TAGTAATTAT TTTCCTGGGG
CAGAGGATGG GGCTTAATAA AAAATTAGCC GCCCTCATAG CTGCCGGGAC GGGTATTTGC
GGGGCAGCAG CCATTGCCGC CATAGCCCCG ATAGTCAGGT CCCGGGATGA TGAAACTGCC
GTGGCGGTGG CTATTGTCGC CCTGCTGGGA ACCCTATTTA CGATTCTTTA CACCCTGCTT
TACCCGGTAC TTAATTTAAC TTCCTTCCAG TATGGTTTAT TATCCGGCAG CAGCCTACAT
GAACTGGCCC ATGTGATTGC GGCAGCCCAG GCCGGGGGCA GCGCCAGCGC TGATATCGCT
ATCCTGGTAA AACTAGGGCG AGTGGCCTTC CTGGTGCCGG TAGCTCTTGT GCTAGGATTA
ATTTTTGCTC GTCAAAATGA AACTGGCGCC GGCTGGCATT GGCGCCAGCT CCAGGTGCCA
TGGTTCATTT TGGGTTTCCT GGTTTTTAGC GGTATCAACA CCATGGCTAT TCTGTCAACT
CCCCTTATCG CATTTTTGAT CCAGGTCGGT GTTTTTCTCC TGACCGTGGC TATGGCCGGC
CTGGGCCTTA ACGTAAGCCT GGAAATGATC AAAAAGGTCG GCAGCCGGGG CCTGGTAACC
GGTTTGCTGG GTTCCGTTGT CCTTAGCTTA ACTATCTTTC TGGTTATTGC CAGTTTGATT
AATTAA
 
Protein sequence
MSTRNPTMTE GNQGLGLIQG VGLTVILTLV ARQLAMLPIL KIMGSMVLAI LLGVAWRSLM 
DIPATAEVGI NFASKKILRY GIILMGLRLD IPKIIAAGPQ VILLDILAIL VSMVVIIFLG
QRMGLNKKLA ALIAAGTGIC GAAAIAAIAP IVRSRDDETA VAVAIVALLG TLFTILYTLL
YPVLNLTSFQ YGLLSGSSLH ELAHVIAAAQ AGGSASADIA ILVKLGRVAF LVPVALVLGL
IFARQNETGA GWHWRQLQVP WFILGFLVFS GINTMAILST PLIAFLIQVG VFLLTVAMAG
LGLNVSLEMI KKVGSRGLVT GLLGSVVLSL TIFLVIASLI N