Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1835 |
Symbol | |
ID | 3832804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1892074 |
End bp | 1893207 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829765 |
Product | hypothetical protein |
Protein accession | YP_430678 |
Protein GI | 83590669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000242425 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCCGGG TTTTATTCTT GCGCCGGCGG TGGATGAAGC TGGCGGGTTC AGGCGTCATC CTCTTGGCCG GCATGGTCTT TTTCCTGCTT AATCAGAAAG GCCTGCCGGT AATAGCGCCT TCCGCCCCGG GGGAGTTGAC GGAGCACCTG ACAACCATTT TTACGGCCCG GGCCAGGGCC CTGGTCAACG GTAATTATGA AGGGCTGGAG GCTTTTTATG ATGCCACGAC GACCAGCGGC CGGTTTGCCC TGAACCATGA AATCGGCCGC ATTAAATACG TCCAGGAATG GTTGCAAAAA CGCCAGGTAA CCTTGACTGG CAGTCACCTG GACCTGGCCG TTGTCGACAG CGGTAGCGAA GGGGATAAGG GCTGGGCCTC GGTATCCCAG CATCTGATCC TCAGTTACCG GCACCAGGGG GAGCCGCAAG AAACAGTCAA CCGGATGGGG TTTCGTACCC TCCACTGGGT GGAGCTGGTC AAGCGGGACG GCCGCTGGCT GATCAACCGC GACTGGTACT GGGACCCTTT TGAAACCGAC GACCTGAAAC CAGAAATCGC CCCCGGCACG GCTGTATGCA AGGCGCTGCC GCCGCCGGTA AAGGGTAAAT ACCGCCGTGA GGCGGCGGTG GTCTATGCCG ACCGCTACAG CGGCGTGCGC CTGGGTCCCG GGGACGGCCG CTACAACCGG AATTACCGTG ACTTTACCGG CCTGGGCGGC GATTGTGCCA GCTTTGCCTC CCAGGTCTTG AGCGACAAAG AAGCCGGCGG CATACCCCGG GACTGGGTTT GGAATTACCA TAACGGCGAG GGCAGCCAGG CCTGGGCCCA GGCAGCCGCC CTGGTCTATT ACCTCCTGGA CAGCGGCCTG GCGGTGCGCC TGGCAAGAGG TGATTTCCAG GAAGTAACCC GGTCCACTTC TAATTACCCC TACGGGGCGG TCAACGCCCT GCAACCGGGT GACATCATCG GTTATGAAGA AGGGGGCGAG TTAAGCCATG TCTCGGTGGT TGTAGGCCGG GACTCGGCCG GATATGTCCT GGTCGACAGC CATACGGCCG ACCGTTACCA TGTCCCCTGG GACATGGGTT GGAAGAGCGG GACCATCTAC TGGCTCCTCC AGGTAGTCTA TTGA
|
Protein sequence | MVRVLFLRRR WMKLAGSGVI LLAGMVFFLL NQKGLPVIAP SAPGELTEHL TTIFTARARA LVNGNYEGLE AFYDATTTSG RFALNHEIGR IKYVQEWLQK RQVTLTGSHL DLAVVDSGSE GDKGWASVSQ HLILSYRHQG EPQETVNRMG FRTLHWVELV KRDGRWLINR DWYWDPFETD DLKPEIAPGT AVCKALPPPV KGKYRREAAV VYADRYSGVR LGPGDGRYNR NYRDFTGLGG DCASFASQVL SDKEAGGIPR DWVWNYHNGE GSQAWAQAAA LVYYLLDSGL AVRLARGDFQ EVTRSTSNYP YGAVNALQPG DIIGYEEGGE LSHVSVVVGR DSAGYVLVDS HTADRYHVPW DMGWKSGTIY WLLQVVY
|
| |