Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0997 |
Symbol | |
ID | 3830873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1024951 |
End bp | 1026546 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828926 |
Product | integral membrane protein MviN |
Protein accession | YP_429855 |
Protein GI | 83589846 |
COG category | [R] General function prediction only |
COG ID | [COG0728] Uncharacterized membrane protein, putative virulence factor |
TIGRFAM ID | [TIGR01695] integral membrane protein MviN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000225654 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00636735 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAGAAC GCAGGACGAT GGGACTGGCC CGGTCGGCGG CAATCCTTAG CCTGGCCTCG GCCTTTTCTC GTATCCTCGG TTTTCTTCGT AACACGGCCA TATCGGCCCT CTTCGGCCAA AACCGGCTGA CTGACATGTT AAACACCTCC TTCGTGATCC CAGATACTAT TTATTTAATC CTGGTGGGCG GGGGAGTAAG TTCAGCCTTT ATCCCGGTTT TGTCAAGCTA CCTGGCCGAG CAGGACGAAG ACGCCGTCTG GCAGACGGTG AGTATTGCCT TTAACCTGGT CCTGGCCGTG GTAGGCCTGG CCGTTATCCT GGGGATGATC TGGACCCCTA ACCTTGTTCA CCTGGTGGCC CCGGGCTTCA CCCCAGACCA GGTTGCCTAC ACCGCCTACT TGACCAGGAT CGTCCTGGTG GCCATCCTCT TCCACTGCCT TAACGGGGTT TTAATCGGTA CGGAATACGC CTACCAGTCC TTTATCGGCA CGGCCATCGG TCCCCTGGTC TATAATGCCG CCATCATCGT CTTCGGGCTG GCCCTGGCCG GGAGGTATAG TATTGCCGCC TTCGCCTTCG CCACCCTGAT TGGCGCTTTC CTGAATTTCC TGGTGCAGGT GTGGGGAATC TGGCGTCTAC GCCCCCGCTT TTCCCTGGTG CTGGACTTGA AGAATCCCGG CATCCGTAAG ATATTCAAGT TAATGTTGCC CGTGACCGTA GGTCTCTCTA TTGCCCAGCT CAACCTTTTC TTTAATCAAA CCTTTATTGC CTCCTACCTG CCCCGGGGTT CGATTAACGC CCTGACCATT TCCAGCCGGG TGGTACTGGT GCCCATCCTC TTTGCTTCCT CCATCGGCAT CACCCTGTTG CCGGCCCTCA CCCGGATGTA CCTGGAAGGG GATCAGGCCG CCTTTACCCG TTACCTCTCC GGTTCCCTGC GGGCCGTGCT CTTTATTTCC ATCCCGGCGA CTGTGGGTCT GGTGGTCCTG GGCCAGCCGG TGATCAGGGT CCTCTTCCAG CACGGCAACT TTACCAGCGC CGATACCATG GCTACCACCG AGGCCCTGGT ATTTTACTCC CTGGGCATCA GCGCTTACGG CACCTACGAG ATCTTGAGCC GCGCCTTTTA CGCCATTAAG GATACCGTCA CGCCCCTCTG GATCGGCCTG ATCACCCTGG CGGCAGGTAC GGCTTTGAAT TTTACCCTGG GACCGGCCTT CGGCATTCGC GGTCTGGCCC TGGCCTACTC CCTGGCCGGT TTTATCAACG TTTCCCTGTT GTTCTACTAC CTGCAGGTTA AAGCCCGGGC CCGTTTTGAA GGCCGCCGGA TGGTACAGAC GGCCGCCAAG AGCCTCCTGG CAGCCCTCGT GATGGGCCTC TTGCTGGTAC TAATATCCCG CCACCTGGCC CTGCCCGCCG CCTGGCCCCG CCTGGTGCGG GAGGGTCTGG AATTGTCCCT GATGATAACC CTGGGAGCAG GAAGCTATTG CCTCCTGGCC TGGCTATTGC GCATGGAAGA ACTGGTATCC TTCCTGAACA TCCTGGGCCG CCGGCTGCAA CGTTCCCGGC CGGCAACCGG TTATAAGAAG GAGTGA
|
Protein sequence | MTERRTMGLA RSAAILSLAS AFSRILGFLR NTAISALFGQ NRLTDMLNTS FVIPDTIYLI LVGGGVSSAF IPVLSSYLAE QDEDAVWQTV SIAFNLVLAV VGLAVILGMI WTPNLVHLVA PGFTPDQVAY TAYLTRIVLV AILFHCLNGV LIGTEYAYQS FIGTAIGPLV YNAAIIVFGL ALAGRYSIAA FAFATLIGAF LNFLVQVWGI WRLRPRFSLV LDLKNPGIRK IFKLMLPVTV GLSIAQLNLF FNQTFIASYL PRGSINALTI SSRVVLVPIL FASSIGITLL PALTRMYLEG DQAAFTRYLS GSLRAVLFIS IPATVGLVVL GQPVIRVLFQ HGNFTSADTM ATTEALVFYS LGISAYGTYE ILSRAFYAIK DTVTPLWIGL ITLAAGTALN FTLGPAFGIR GLALAYSLAG FINVSLLFYY LQVKARARFE GRRMVQTAAK SLLAALVMGL LLVLISRHLA LPAAWPRLVR EGLELSLMIT LGAGSYCLLA WLLRMEELVS FLNILGRRLQ RSRPATGYKK E
|
| |