Gene Moth_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0997 
Symbol 
ID3830873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1024951 
End bp1026546 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content59% 
IMG OID637828926 
Productintegral membrane protein MviN 
Protein accessionYP_429855 
Protein GI83589846 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000225654 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00636735 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACAGAAC GCAGGACGAT GGGACTGGCC CGGTCGGCGG CAATCCTTAG CCTGGCCTCG 
GCCTTTTCTC GTATCCTCGG TTTTCTTCGT AACACGGCCA TATCGGCCCT CTTCGGCCAA
AACCGGCTGA CTGACATGTT AAACACCTCC TTCGTGATCC CAGATACTAT TTATTTAATC
CTGGTGGGCG GGGGAGTAAG TTCAGCCTTT ATCCCGGTTT TGTCAAGCTA CCTGGCCGAG
CAGGACGAAG ACGCCGTCTG GCAGACGGTG AGTATTGCCT TTAACCTGGT CCTGGCCGTG
GTAGGCCTGG CCGTTATCCT GGGGATGATC TGGACCCCTA ACCTTGTTCA CCTGGTGGCC
CCGGGCTTCA CCCCAGACCA GGTTGCCTAC ACCGCCTACT TGACCAGGAT CGTCCTGGTG
GCCATCCTCT TCCACTGCCT TAACGGGGTT TTAATCGGTA CGGAATACGC CTACCAGTCC
TTTATCGGCA CGGCCATCGG TCCCCTGGTC TATAATGCCG CCATCATCGT CTTCGGGCTG
GCCCTGGCCG GGAGGTATAG TATTGCCGCC TTCGCCTTCG CCACCCTGAT TGGCGCTTTC
CTGAATTTCC TGGTGCAGGT GTGGGGAATC TGGCGTCTAC GCCCCCGCTT TTCCCTGGTG
CTGGACTTGA AGAATCCCGG CATCCGTAAG ATATTCAAGT TAATGTTGCC CGTGACCGTA
GGTCTCTCTA TTGCCCAGCT CAACCTTTTC TTTAATCAAA CCTTTATTGC CTCCTACCTG
CCCCGGGGTT CGATTAACGC CCTGACCATT TCCAGCCGGG TGGTACTGGT GCCCATCCTC
TTTGCTTCCT CCATCGGCAT CACCCTGTTG CCGGCCCTCA CCCGGATGTA CCTGGAAGGG
GATCAGGCCG CCTTTACCCG TTACCTCTCC GGTTCCCTGC GGGCCGTGCT CTTTATTTCC
ATCCCGGCGA CTGTGGGTCT GGTGGTCCTG GGCCAGCCGG TGATCAGGGT CCTCTTCCAG
CACGGCAACT TTACCAGCGC CGATACCATG GCTACCACCG AGGCCCTGGT ATTTTACTCC
CTGGGCATCA GCGCTTACGG CACCTACGAG ATCTTGAGCC GCGCCTTTTA CGCCATTAAG
GATACCGTCA CGCCCCTCTG GATCGGCCTG ATCACCCTGG CGGCAGGTAC GGCTTTGAAT
TTTACCCTGG GACCGGCCTT CGGCATTCGC GGTCTGGCCC TGGCCTACTC CCTGGCCGGT
TTTATCAACG TTTCCCTGTT GTTCTACTAC CTGCAGGTTA AAGCCCGGGC CCGTTTTGAA
GGCCGCCGGA TGGTACAGAC GGCCGCCAAG AGCCTCCTGG CAGCCCTCGT GATGGGCCTC
TTGCTGGTAC TAATATCCCG CCACCTGGCC CTGCCCGCCG CCTGGCCCCG CCTGGTGCGG
GAGGGTCTGG AATTGTCCCT GATGATAACC CTGGGAGCAG GAAGCTATTG CCTCCTGGCC
TGGCTATTGC GCATGGAAGA ACTGGTATCC TTCCTGAACA TCCTGGGCCG CCGGCTGCAA
CGTTCCCGGC CGGCAACCGG TTATAAGAAG GAGTGA
 
Protein sequence
MTERRTMGLA RSAAILSLAS AFSRILGFLR NTAISALFGQ NRLTDMLNTS FVIPDTIYLI 
LVGGGVSSAF IPVLSSYLAE QDEDAVWQTV SIAFNLVLAV VGLAVILGMI WTPNLVHLVA
PGFTPDQVAY TAYLTRIVLV AILFHCLNGV LIGTEYAYQS FIGTAIGPLV YNAAIIVFGL
ALAGRYSIAA FAFATLIGAF LNFLVQVWGI WRLRPRFSLV LDLKNPGIRK IFKLMLPVTV
GLSIAQLNLF FNQTFIASYL PRGSINALTI SSRVVLVPIL FASSIGITLL PALTRMYLEG
DQAAFTRYLS GSLRAVLFIS IPATVGLVVL GQPVIRVLFQ HGNFTSADTM ATTEALVFYS
LGISAYGTYE ILSRAFYAIK DTVTPLWIGL ITLAAGTALN FTLGPAFGIR GLALAYSLAG
FINVSLLFYY LQVKARARFE GRRMVQTAAK SLLAALVMGL LLVLISRHLA LPAAWPRLVR
EGLELSLMIT LGAGSYCLLA WLLRMEELVS FLNILGRRLQ RSRPATGYKK E