Gene Moth_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0216 
Symbol 
ID3831367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp212690 
End bp214291 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content63% 
IMG OID637828152 
Productintegral membrane protein MviN 
Protein accessionYP_429094 
Protein GI83589085 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCG CGAATGATGG CAAGCCAGCC CGGGGTCCTG AGGGCACGCC GGTAGCCCGG 
ATGGCCCGGG CGGCGAGTGT TGTATTGGTA TTGAACCTTT TGAGCCGGGT GCTGGGCTTT
GTCCGGGATG CCAGTATTGC CGCCCGCTTC GGCGCCGGGC CGGCCACCGA TGCTTACCTG
GTGGCCTACA CCATCCCCTT TTTCCTGCAA ACCATCCTGG GGATGGCCTT TGTGACGGTG
ATGGTGCCGG TGGTCACTAC TTACCTGGTG CGGGGCGACC GCGACCAGGG GTGGGCGGTA
GCCAGCGCCG TGGGCAACTG GACGGCCCTG ATCCTGGGAT TGCTGACCAT TGTGGGCCTT
GGGGTGGCGC CCTGGCTGGT CCGGCTCATG GCGCCGGGTT TTCCGGCGCC GGTCTTCGAT
CTGGCTGTCA AGCTGACCCG GATTATGTTC CTCTCCCTGG CCTTTATGGG TACAGGCATG
CTGGTCAGCG GTATTTTAAA CGCCGGTTAT ATCTTTACTT CCCCGGCCCT GGCGCCGGCA
GTGAGCAACC TGGTGATTAT TGCCACGGTG ATCTTTGCCG GGTCAGCCTT TGGTATCACC
GGACTGGCGG TGGGTACTGT CCTGAGCTTC GTGGCTTACC TGTTAATCCA GCTTCCCGAC
CTGCCGCGCC TGCAGTTCCA CTACACCTGC AGCCTCATGG CAGGTCACCC GGCAGTGCGG
AGAATCGGCC GGCACCTCCT GCCGGTATGT TTCAGTTTGG CGGTAGTCCA GCTCTACCTG
GCGACGAACC GCTTTTTTGC TTCCCAGCTG GAGCCCGGGA GTATTACCGC CCTGGACTTT
GCCAACCGCC TGGTGAACCT GCCCCTGGGG GTCTTTGTCG CCAGCGTGAC CACCGCCATC
TTTCCCTCCC TGGCCGAGCA GGCGGCCCTT AATGACCGCC GGGAAATGGC CCACCTGACG
GACCGCGGCC TGGGGCTGGT GGCTCTGACT ATTTTGCCGG CGGCGGTCGG GATGATTGTC
CTGCGGGTGC CCCTGGTGCA ATTGGTCTTC CAGCGCGGGG CCTTCGATCC CCGGGCTACG
GCCATGACGG CTGTGGCGGT ACTCTTTTAT TCTGTTGGCC TCCTGGCTCA GGCCATGCAT
CCCATCCTTA CCCGGGCCTT TTACGCCCTC CAGGATGTGG TTGTCCCGGT GGTTACAGGT
ATTATTTCCG TTGGCCTGAA CATCCTCCTT AGCTATTTCC TGGCCCCGCG CCTGGGGCAC
GGCGGCCTGG CCCTGGCCAA TTCCCTGGCG GCCAGCATCT ACGCCCTGAT GCTTTACCTG
GCCCTCTACC GGCGCCTGCC GGAGTTAAAA GTAACCTTGC TCTTAAGTAC CATGTTGCGG
ATTTTCCTGG CGGCCATGGG CATGGGACTC CTGGTCTGGC TGGCCGGAAG AGGCCTGCAT
GTTTTCACCT GGTCGCGGCC CCTCCTGGGG CTCCTCGTGC GGATGACGTT GTTAATGGGG
GGCGGGGGCC TGGCTTTCTG GGTTCTGGCC CGGTGGCTGA AGGTAGAGGA AGTGACCTTC
ATCACCGCCA TGATCCGGCG GCGCCTGGAG CGAGTTTTCT AA
 
Protein sequence
MAVANDGKPA RGPEGTPVAR MARAASVVLV LNLLSRVLGF VRDASIAARF GAGPATDAYL 
VAYTIPFFLQ TILGMAFVTV MVPVVTTYLV RGDRDQGWAV ASAVGNWTAL ILGLLTIVGL
GVAPWLVRLM APGFPAPVFD LAVKLTRIMF LSLAFMGTGM LVSGILNAGY IFTSPALAPA
VSNLVIIATV IFAGSAFGIT GLAVGTVLSF VAYLLIQLPD LPRLQFHYTC SLMAGHPAVR
RIGRHLLPVC FSLAVVQLYL ATNRFFASQL EPGSITALDF ANRLVNLPLG VFVASVTTAI
FPSLAEQAAL NDRREMAHLT DRGLGLVALT ILPAAVGMIV LRVPLVQLVF QRGAFDPRAT
AMTAVAVLFY SVGLLAQAMH PILTRAFYAL QDVVVPVVTG IISVGLNILL SYFLAPRLGH
GGLALANSLA ASIYALMLYL ALYRRLPELK VTLLLSTMLR IFLAAMGMGL LVWLAGRGLH
VFTWSRPLLG LLVRMTLLMG GGGLAFWVLA RWLKVEEVTF ITAMIRRRLE RVF