Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0726 |
Symbol | |
ID | 3831002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 757587 |
End bp | 758879 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828657 |
Product | Type III effector Hrp-dependent outers |
Protein accession | YP_429587 |
Protein GI | 83589578 |
COG category | [S] Function unknown |
COG ID | [COG3395] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00533743 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.660892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGA TATCCATCAT CGCCGACGAT TTAACCGGAG CCAACGATAC CGGCGTCCAG TTTTGCCAGC ACGGTTTCCG CACCATGGTT ATTATAGATG CTGCCAACGT AGAGCGGGTG GGGCAGGATA AAGATGTCTG GGCGATCAAT ACCGACACCC GCCACCTGGC AGCACCTGAA GCCTACCAGC GCGTTTATGA GATCACTTTA AAACTAAAAA AAGCTGCCAT CAGCCGGGTT TACAAAAAGA TTGATTCCAC CCTGCGCGGC CACCCCGGCG CCGAGCTGGA GGCTGTGATG GACGCCTGGC AGGCGGACCT CGCCCTGGTG GTGCCGGCCT ACCCGGCCAA CCGGCGGTTA GTGGTTGACG GCCACCTGTT GATAAGCGAG GGCATGGAGA CGGCCGCGGC TTCCGTAAGC CTTACTCCTG GCGATGCCAG GGCAGCCCTT TGCCACATCC CTACCGTCCT GCAGGGGGAG ATGGGCCGTC GGGTAGGCCA GATTAACCTG GCGACTGTGC GCCAGGGAGT GAAAGAACTG GTAGCTGCCC TGGAGGCCGC TCGTACAAAC AGCCAGGTGC TGGTCCTTGA TGCCGCCGAC GAAGAGGACC TAAGGAATAT CGCCCGGGCA ATCAGCCGCT TCCAGCGGGA TGTCATTGTG GCCGGCGCCG CCGGCATGGC CGCCCATTTA CCTCTGGCCT GGAACCTAAA ACCAGTGCCT AATAATCCAT TAAATAAAAA GGGGGCTATT CTCCTGGTTG CCGGCTCGCG TAACCCGGTC ACTGCCGCCC AGGTGCAACG CCTGGCTGAG GTTAGCGCGT GTCAGGCTGT AAAGGTAGAG ACGGAAGCTA TACTTACCGG AGAACCGGCT GTTGAAATAG AAAGGGTGTT GCAGGAAGTT ACAACTCAAG ATGCAGGCGC AGGTTTAATT ATTATAGCCG TAGATAGCCT TTTCCAGACA ATTGACAGAG ATAGGGTTTC CAACTCAGGA AGCAAAGCTA TAGCTTTAGC CCTTGGCACT ATCACCAGCC GCCTCTTAAA TATGCGAAGG ATAAGTGCCC TGGTAGTTAC TGGCGGAGAT ACTGCCGTTC ACGTTTGCCG GGCTCTGGAA GCCAGAGGAA TTAACCTGGC GGCCGATCTG TTGCCGGGTA TCCCTTTGGG GTACCTGGAA GGGGGGCGGG GTGATGGACT ACCAATCGTT ACTAAAGCCG GCGGTTTTGG TTCCCCCGAT TCCCTGATCA AAGTAAATGA ATTTCTTCAA CAGAGAATGA AAAGTGAAAT GGAGTTGGTA TGA
|
Protein sequence | MEQISIIADD LTGANDTGVQ FCQHGFRTMV IIDAANVERV GQDKDVWAIN TDTRHLAAPE AYQRVYEITL KLKKAAISRV YKKIDSTLRG HPGAELEAVM DAWQADLALV VPAYPANRRL VVDGHLLISE GMETAAASVS LTPGDARAAL CHIPTVLQGE MGRRVGQINL ATVRQGVKEL VAALEAARTN SQVLVLDAAD EEDLRNIARA ISRFQRDVIV AGAAGMAAHL PLAWNLKPVP NNPLNKKGAI LLVAGSRNPV TAAQVQRLAE VSACQAVKVE TEAILTGEPA VEIERVLQEV TTQDAGAGLI IIAVDSLFQT IDRDRVSNSG SKAIALALGT ITSRLLNMRR ISALVVTGGD TAVHVCRALE ARGINLAADL LPGIPLGYLE GGRGDGLPIV TKAGGFGSPD SLIKVNEFLQ QRMKSEMELV
|
| |