Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2269 |
Symbol | |
ID | 3831380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2376428 |
End bp | 2377768 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637830189 |
Product | hypothetical protein |
Protein accession | YP_431099 |
Protein GI | 83591090 |
COG category | [S] Function unknown |
COG ID | [COG5441] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00139648 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAGCGA AGGACATTGC CATTATCGCC ACAGTAGACA CCAAAGAGGC CGAAGCCCGT TTTCTGCAGG AGTTTATCAC CAGTCATGGC TGGCAGGCCC CGGTGCTGGA TGTCAGCACT CATCGTCCCC ATAATTTTCA GGCGACCTAT TCCAGGGAAG AGATCTGCCG CCGGGCTGGG GTGGAGTACA AGGATTTGGG CACCTTGCGC CGGGATGCCA TGATGGCAAC CATGGGCCGC GGAGCGGCCC GGGTATTAAT GGAACTTTAT GACCGGGGAG AGCTGGCGGG CGTCCTGGGC ATCGGCGGCA ACCAGGGTAC GGCCATAGCA GCTATGGCCA TGCGCTCTTT GCCTGTCGGG CTGCCCAAGT TAATAGTTTC TACGGTGGCC TCGGGCAATG TCCGGCCCTA TGTAGAGTAC AAGGACATTA CCATGATGTT CTCAGTAGCC GACCTGCTGG GTGGTCCCAA CACCGTCAGC CGCACTATCC TCAGCAATGC TGCCGGGGCG GTGATAGGAA TGGCCGCCTG GGGCCAGCCC CTGAAGGCGG GGGAACGGCC GGTAATTGCC ATCACAGCTC TGGGCAATAC CGACCCGGCA GTAGCAGCCG CCCGGGGGCG ACTGGTGGAA CTGGGTTACG AAGTGATAGC CTTTCATGCT TCCGGGACCT GCGGATCAGC CATGGAAGAA CTAATCGAAG CAGGGTTAAT AAACGGCGTT CTGGATCTGA CCCCCCACGA GTTGATCGGC GAGGTCCATG GCGCTGATAT TTATACTCCC CTGCGGCCGC GCCTGGAAGC TGCAGGCAGG CGGGGGATTC CCCAAGTTGT TTCCTTGGGC GGCCTGGATT ACTTCTGTTT TGGACCGGCA GATACCATAC CGCAGCGTTT CCAAGGCCGG AAGACCCACT ACCATAACCC CTACAATACC AATGTCCGGG CTACCGGGGG TGAACTGGCC CAGGTAGGCG AAGTCATGGC CGCCAAGCTA AATGCCGCTC GCGGTCCGGT GGTGGTGATG GTCCCTCTCA AGGGCTGGTC GGAAAACGGC CGGGCCGGTG GCCCCCTGTA CGATCAGGAA GCCGACGCCG CTCTGGTGGC GTCCCTGGAG GCCAACCTGA ATCCCGGGAT AAAACTTATG AAACTCAACG CCCATATTAA CGACCCGATC TTCGCCGCCA GCGCCGTCGC TGTTTTGCAC CAGTTGATGG AGGTTTCTCG GCCGGTAGAT GGCACCTTTC CCAGGGAAGC CGTGGAGAAG GGCACACTCC CTCCAAAAAA CCCGAAATGG AGGCGATCGT TAACGCCAGA AAGCGCAATC GTAAAGCAAG CACCAAGGTG A
|
Protein sequence | MTAKDIAIIA TVDTKEAEAR FLQEFITSHG WQAPVLDVST HRPHNFQATY SREEICRRAG VEYKDLGTLR RDAMMATMGR GAARVLMELY DRGELAGVLG IGGNQGTAIA AMAMRSLPVG LPKLIVSTVA SGNVRPYVEY KDITMMFSVA DLLGGPNTVS RTILSNAAGA VIGMAAWGQP LKAGERPVIA ITALGNTDPA VAAARGRLVE LGYEVIAFHA SGTCGSAMEE LIEAGLINGV LDLTPHELIG EVHGADIYTP LRPRLEAAGR RGIPQVVSLG GLDYFCFGPA DTIPQRFQGR KTHYHNPYNT NVRATGGELA QVGEVMAAKL NAARGPVVVM VPLKGWSENG RAGGPLYDQE ADAALVASLE ANLNPGIKLM KLNAHINDPI FAASAVAVLH QLMEVSRPVD GTFPREAVEK GTLPPKNPKW RRSLTPESAI VKQAPR
|
| |