Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2225 |
Symbol | |
ID | 3830832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2319955 |
End bp | 2321154 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637830145 |
Product | hypothetical protein |
Protein accession | YP_431055 |
Protein GI | 83591046 |
COG category | [R] General function prediction only |
COG ID | [COG1287] Uncharacterized membrane protein, required for N-linked glycosylation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.526626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000291502 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTGTTTA AAGTAAGACG GGAAGCGATC AACACCCTGG CGGTAACCGT GGCCCTGGTC CTGATTGTCA CCCTGGCCCT TTATCTCAGG TTAAAGTTCG TTTTTACCAT CGACCATCCA CCCTTGAAGG GCTTTTCCTC CGACGCTGTC AACTATGACC TCATGGCGCG CCAGTTCCTG GATAAGGGGT TTCTGGGCTA TATGTCCAGC CGGCCCAATG CCTATATCAC CCCGGGCTAT CCCCTGTTCC TGGCTTTGAT TTACAAACTC TACGGTTATG CCCAGGGGAG TCCCCTGCAG GCGGTGCGGG TGGTCCAGGC CTGCCTGGGC ACCCTGACGG TGGTGCTGTT GTACCTGGCG GGCCGGGAGG TAAAAAACAC CAGGGTGGGG CTGGTGGCCG CCCTGCTGGC GGCTATTTAC CCCACCTTTG TCTGGGCCCC GACTATCCCT TTAACCGAAG TAGTTTATAC CTTCTTTTTT ATGCTCTATT TTTACCTCCA GCTCCGGTAT TTACGCCATC CTTCCCCCCT GGGAGGCGTT TTAACGGGGC TGATTTTTGG CCTGGCCATC CTGGTTCGCC CGGCGGCGGC CCCCCTGATC GTGGTACCCT TCCTTTATGA TTTTTACCGG CGTAAGGAGT GGCGTTCCTC CCTCAAGGGT TTCCTGTACA CCCTGGGAGG GTTTGTGGCC GTCATGCTGC CCTGGTGGAT ACGCAACCTG GTAACCCTGC ACCAGTTCAT CCTCCTGGCC ACCCAGACCT GGAACCCCCT CCTTTACGGC GCCTTTCCTT ACTTCACAGA TATGGACAAG GTGCCTCCCA TCCAGTCCAC CCAGGAGGCC TTGCATTTTA TCCTCCGGGG CTTTTTAAGG AACCCGGTGT TGTACCTCAA GTGGTATACA ATCGGCAAGT GGCAGGTTAT CTTTGGCAAT ATGTGGTACG GTCTTGACCT TTCCCGCTAT CAGTACTTGC GTTCGGTTTA CTGGGTGCAC AATTTTATCA CCATGGTGGG CTGGTTGGGG TCTTTTAAGG CCCTCAAGGA GGGAAGAGTA GGCCTGGTGG CCATCTTTAT TTTTCTCCTG ACGGCCATCC AATTGATGTT TATCCCCACC GTCAGGTATG CCTTTACCAT CATGCCGTTC TTGATGCTCA CCACCGCTTG GCTTATGGAT CTCCTGTTCG GAGCGGAAGA GGCTGCTTGA
|
Protein sequence | MLFKVRREAI NTLAVTVALV LIVTLALYLR LKFVFTIDHP PLKGFSSDAV NYDLMARQFL DKGFLGYMSS RPNAYITPGY PLFLALIYKL YGYAQGSPLQ AVRVVQACLG TLTVVLLYLA GREVKNTRVG LVAALLAAIY PTFVWAPTIP LTEVVYTFFF MLYFYLQLRY LRHPSPLGGV LTGLIFGLAI LVRPAAAPLI VVPFLYDFYR RKEWRSSLKG FLYTLGGFVA VMLPWWIRNL VTLHQFILLA TQTWNPLLYG AFPYFTDMDK VPPIQSTQEA LHFILRGFLR NPVLYLKWYT IGKWQVIFGN MWYGLDLSRY QYLRSVYWVH NFITMVGWLG SFKALKEGRV GLVAIFIFLL TAIQLMFIPT VRYAFTIMPF LMLTTAWLMD LLFGAEEAA
|
| |