Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1319 |
Symbol | |
ID | 3831029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1363770 |
End bp | 1365248 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637829255 |
Product | stage IV sporulation protein A (spore cortex formation and coat assembly) |
Protein accession | YP_430175 |
Protein GI | 83590166 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02836] stage IV sporulation protein A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.412877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAAAAC GGGACATTTT CCGGGATATA ACTGAACGTA CCGGTGGCGA TATTTATCTG GGTGTCATGG GGCCGGTACG CAGCGGCAAA TCAACCTTCA TCACGCAGTT CATGGAAAAG CTGGTACTAC CGAACATTAA AAACCCTAAT GATCGTGACC GGGCCAGGGA TGAATTGCCC CAGAGCGGCG CCGGTCGCCT GATAATGACT ACGGAGCCCA AGTTTATCCC CAATGAAGCG GTAGAAATCG CTATTCGCGA AGGGTTGAAG ATGCGGGTCC GGCTGGTTGA TTGCGTTGGC TACACCGTAC CGGGGGCCCT GGGCTATGAG GATAATGAAG GTCCCCGGAT GGTGCGGACG CCATGGTTTG AACACCCGGT ACCCTTTCAG GAGGCAGCCG AAACCGGGAC TCGCAAGGTC ATTACCGACC ACGCTACTAT CGGCGTTGTC GTAACCACCG ACGGCAGTAT TACTGAAATA CCGCGGCAGG ATTATGTTGA TGCCGAAGAA AGGGTTATCT GGGAACTCAA GGAACTGGGC AAACCCTTTG TCATTCTGCT CAATTCGGTT CACCCCCTGG CGGAAGAGAC CATAGCCCTG GCCGGTGAGC TGGAAACAAC CTACGACGTA CCGGTACTCC CGGTGGACTG CCTGAACTTA ACGGAAGATG ACATCCTGCA TATCATGGAA GAAGCCCTTT ACGAATTCCC GGTGGCCGAG GTCAATGTCA ACTTACCGCG CTGGGTGGAC GAACTAGAAA GCGAACACTG GCTGCGACAG CAGCTGGAGA ATGCCGTCCG GGAGGCGGTG GGTGAAGTCC GGCGCCTGCG GGATATCAAC AACGCCATTG AAAAGCTCGG GGAGAATGAA TATGTCTCCC GGGTCGCCTT GAAGGATATG GACCTGGGAA CAGGCACGGC CCATATCGAT ATGGGCACCC GTGAGGGCCT TTTCCATCAG ATTTTGCGGG AAATCAGCGG CCTGGACATC AGCGGTGATC AGGATATCGT CCGCTGGCTC CGGGAACTGG CCGGCATCAA GAAAGAGTGG GATAAGATCG CCTACGGTAT CCAGGAGGTC CGCAATACCG GTTATGGTGT AGTGACGCCC ACTGAAGATG AGATGGAACT GGCTGAACCA GAGCTTATCA AACAGGGCGG CCGCTCCGGG GTACGGCTCA AGGCCACGGC GCCGTCCTAC CACTTCATCC GCGCCGACAT CACCACCGAG GTGACGCCCA TCATCGGCAC CGAGAAGCAG TGCGAGGATC TGGTTAAGTA TATCATGGAA GAGTTCGAGG ATAACCCCCA GAAGATATGG CAGACCAACG TCTTTGGCAA ATCCTTGAGC GACCTGGTCC GGGAGGGCAT CCAGAGCAAG CTCTACCGCA TGCCGGAAAA CGCCCAGGTC AAACTCCAGG AGACGGTGGA GCGCATAGTC AATGACGGTG GCGGCGGGCT GATCTGCATT ATAATTTAG
|
Protein sequence | MEKRDIFRDI TERTGGDIYL GVMGPVRSGK STFITQFMEK LVLPNIKNPN DRDRARDELP QSGAGRLIMT TEPKFIPNEA VEIAIREGLK MRVRLVDCVG YTVPGALGYE DNEGPRMVRT PWFEHPVPFQ EAAETGTRKV ITDHATIGVV VTTDGSITEI PRQDYVDAEE RVIWELKELG KPFVILLNSV HPLAEETIAL AGELETTYDV PVLPVDCLNL TEDDILHIME EALYEFPVAE VNVNLPRWVD ELESEHWLRQ QLENAVREAV GEVRRLRDIN NAIEKLGENE YVSRVALKDM DLGTGTAHID MGTREGLFHQ ILREISGLDI SGDQDIVRWL RELAGIKKEW DKIAYGIQEV RNTGYGVVTP TEDEMELAEP ELIKQGGRSG VRLKATAPSY HFIRADITTE VTPIIGTEKQ CEDLVKYIME EFEDNPQKIW QTNVFGKSLS DLVREGIQSK LYRMPENAQV KLQETVERIV NDGGGGLICI II
|
| |