Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1040 |
Symbol | |
ID | 3831846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1067225 |
End bp | 1068325 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637828968 |
Product | hypothetical protein |
Protein accession | YP_429897 |
Protein GI | 83589888 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | [TIGR02872] sporulation integral membrane protein YtvI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00428171 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000129007 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | TTGGCCGGCC TGAACGGGCG CTTTCAACAG ACTTTCCAGT CTCTATTACT AGCCTTAATG GCTGCCGTCC TGTTCCTGTT GCTTTATTAT TACATATTTC CTGCGGCCAG GGAGATTATC AAGACCCTGG TCCCTATCGT CTTGCCCTTT GCCCTGGCAG CTTTACTTGC CGCCATTATC GATCCTGCAG TCAACCTGCT TGAAAAAAAG TTAAAAATAG GACGTGGCTG GGCTGTCATC ACCACTCTGC TCCTGGTACT GGCCATTATG GGCGTAGCCC TGTTTTACCT GCTCGCCAAT CTAATTATCG AGCTGGAAAG CCTGGTCCTG AACCTGCCAG CCCAGGCCCG CAGCCTGGGA GCGCTTCTCC AGGAGTATTT TTACCGCCTG CAGGGCTTTT ATTTCGGCGG CAACCTGCCG CCGAACATCT TGATTTCCTT TCAATCTTTA TTCAATAATG CCGTTAACGT TTTAAAAGGT TTCCTTACCC AAACAGTTCA GGGACTGGTT ATTATTGTCA GCTCCCTGCC CGACTTCTTT ATTTTTGTTA TCATTACCCT GGTGGCCACC TATTTCTTCA GCCGGGATAA GGAACTAATC CTGCGGACCT TGCTCCGGGT TATGCCGGCC GGGTGGCGGG AACGGACGAG CCGGGTTTTC AGTTCCCTGG GCCAGGCGAT TATCGGTTAC CTGCGGGCAG AGATCCTGTT AATCAGCCTG CAGATGACCC AGAGCGTCCT CGGTCTCCTG ATTTTAAAGG TGGACTACGC CCTGACCCTG GCCTTTTTGA TCGGCCTGGC CGACTTACTC CCCATTGTAG GGCCGGGTAC GGTCTTTATC CCCTGGATCA TTATTGAATT TATCCTCGGC CACTACGGCC TGGGGCTGGC CCTGCTGATT CTCTACGCCT TTATTATCAT CCTGCGCCAG GTACTCCAGC CCAAGCTGGT GGCTGTCAAC CTGGGCCTGT ACCCTTTAAC CACTTTGATT GTCCTTTATG CCGGCTTAAA GCTCCTGGGA GTAGTGGGCC TGGCCTTGGG GCCTCTGACC ATTGTTGTTT TAAAGGCCTT TTTCCGTTCC GGACAGGAGG TTAATAAGTA A
|
Protein sequence | MAGLNGRFQQ TFQSLLLALM AAVLFLLLYY YIFPAAREII KTLVPIVLPF ALAALLAAII DPAVNLLEKK LKIGRGWAVI TTLLLVLAIM GVALFYLLAN LIIELESLVL NLPAQARSLG ALLQEYFYRL QGFYFGGNLP PNILISFQSL FNNAVNVLKG FLTQTVQGLV IIVSSLPDFF IFVIITLVAT YFFSRDKELI LRTLLRVMPA GWRERTSRVF SSLGQAIIGY LRAEILLISL QMTQSVLGLL ILKVDYALTL AFLIGLADLL PIVGPGTVFI PWIIIEFILG HYGLGLALLI LYAFIIILRQ VLQPKLVAVN LGLYPLTTLI VLYAGLKLLG VVGLALGPLT IVVLKAFFRS GQEVNK
|
| |