Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1583 |
Symbol | |
ID | 3832088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1617514 |
End bp | 1618857 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829512 |
Product | VWA containing CoxE-like |
Protein accession | YP_430432 |
Protein GI | 83590423 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000025338 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.503012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACAA TTCCGGAAGT CCTGCAGGAG ATAGCTACCT TATTGCGCGC CGGGGGTATA GAGGTGACCC TGGCGGAAAT GGTGGATGCT GTACGGGGCT GGCAGACCAC CCCGGGGGCT TACTGGCCGG AGGTTTTGCA GGTAACCCTG GTCAAGAGAT ACGAGGATTT GAAGCCCCTG GCGGCCTTGC TCTCCCTCAC CGGCAGCCAT GGCTGGCCGG AACAGCCTGC CTGTAAGAAC AGGGGTATGG CTTCCGGGCA GGCTACGCGG CACGCAGGCG GGCAGTTGCC GGTCCAGGAC GTAATGGCAG CTCTTTTTAA TGGTTCTGAT AAAGACCTCA ACGACCTGGC GGAGCAGGCC ATCGACCTGC TGGGAGAATT GAATGCCGAA GCTCTGGACC ACCTGGAGGG TAAGGTAAGG GAAGCCAGGT TGGCTTTAAA CTGGCATATG GTCCGCCACA GGTTAAGGGT TATGGAAAGC GAGGGGCGGC AGGAGGCCCG GGAGGCCCAA CAGAGGTTGC AGCTATTGGC AGCGATTATT CGCCGGAACC TGGAGTTGCG CCTGGTGCAA GAGTTTGGTC CTGAAGCCAT GCTGGCTATT CTCCGTACCT ACAATCTGGC CGAAAAAGGA TGGAGCGAGC TGGCTGAAGG GGATCTGGCC GTTATGCGAC CGTATTTAAA GAAACTGGGT CGCTATCTGG GAAATAAGTA TTCCTGGCGT TACCGGCCCG CTCACCGGGG GAAGATCGAC CTGCGCCGCA CTGTTAAAGA AGCCTGCCGG CACGGCGGCG TTCCCTGGCA ACTCCGCTAC CGCGACCGTC GCCGGGAGAG GCCGGTCCTT TTTGTCCTGG GCGACATTTC AGGTTCGGTA GCGCCCTTTA GCGTCTTTAT GCTGGAATTG ATCTACGCCA TGCAGCATGC CTTTCGCCAG GTCCGGACCT TTGTTTTTGT TGATGACCTG GCTGAAGTTA CCAACGCCAT CAGGGAATCG CAGGACGCCG GGGCCATGGA GCAAGTGGCC CGTTTCGCCC GCTGCTCGGT TAGCGGTTAC TCCGACTTCG GGCGTGTCTT CAAGCTGTTT CTGGAACGCT ATGGAGAGGT TTTAACCCCT GAGACTACAA TTTTAATCCT GGGCGACGCG CGCAATAACT GGCGGCAACC GGAGGTCGAT AGTTTTGCAG CTATTTGCCG TAAAGCAGGA AAGGTAGTAT GGCTGAACCC GCAACCGGAA GCCTCCTGGA ATACCATGGA TAGCTCCATG GCTCTTTATG CCCCCTTCTG CCACGCTGTC AGGGAATGCT CGAATCTAAA GCAGCTAATT GCCATTGCCA GGGAGGGGCT TTAA
|
Protein sequence | MDTIPEVLQE IATLLRAGGI EVTLAEMVDA VRGWQTTPGA YWPEVLQVTL VKRYEDLKPL AALLSLTGSH GWPEQPACKN RGMASGQATR HAGGQLPVQD VMAALFNGSD KDLNDLAEQA IDLLGELNAE ALDHLEGKVR EARLALNWHM VRHRLRVMES EGRQEAREAQ QRLQLLAAII RRNLELRLVQ EFGPEAMLAI LRTYNLAEKG WSELAEGDLA VMRPYLKKLG RYLGNKYSWR YRPAHRGKID LRRTVKEACR HGGVPWQLRY RDRRRERPVL FVLGDISGSV APFSVFMLEL IYAMQHAFRQ VRTFVFVDDL AEVTNAIRES QDAGAMEQVA RFARCSVSGY SDFGRVFKLF LERYGEVLTP ETTILILGDA RNNWRQPEVD SFAAICRKAG KVVWLNPQPE ASWNTMDSSM ALYAPFCHAV RECSNLKQLI AIAREGL
|
| |