Gene Moth_1319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1319 
Symbol 
ID3831029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1363770 
End bp1365248 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content56% 
IMG OID637829255 
Productstage IV sporulation protein A (spore cortex formation and coat assembly) 
Protein accessionYP_430175 
Protein GI83590166 
COG category 
COG ID 
TIGRFAM ID[TIGR02836] stage IV sporulation protein A 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.412877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAAC GGGACATTTT CCGGGATATA ACTGAACGTA CCGGTGGCGA TATTTATCTG 
GGTGTCATGG GGCCGGTACG CAGCGGCAAA TCAACCTTCA TCACGCAGTT CATGGAAAAG
CTGGTACTAC CGAACATTAA AAACCCTAAT GATCGTGACC GGGCCAGGGA TGAATTGCCC
CAGAGCGGCG CCGGTCGCCT GATAATGACT ACGGAGCCCA AGTTTATCCC CAATGAAGCG
GTAGAAATCG CTATTCGCGA AGGGTTGAAG ATGCGGGTCC GGCTGGTTGA TTGCGTTGGC
TACACCGTAC CGGGGGCCCT GGGCTATGAG GATAATGAAG GTCCCCGGAT GGTGCGGACG
CCATGGTTTG AACACCCGGT ACCCTTTCAG GAGGCAGCCG AAACCGGGAC TCGCAAGGTC
ATTACCGACC ACGCTACTAT CGGCGTTGTC GTAACCACCG ACGGCAGTAT TACTGAAATA
CCGCGGCAGG ATTATGTTGA TGCCGAAGAA AGGGTTATCT GGGAACTCAA GGAACTGGGC
AAACCCTTTG TCATTCTGCT CAATTCGGTT CACCCCCTGG CGGAAGAGAC CATAGCCCTG
GCCGGTGAGC TGGAAACAAC CTACGACGTA CCGGTACTCC CGGTGGACTG CCTGAACTTA
ACGGAAGATG ACATCCTGCA TATCATGGAA GAAGCCCTTT ACGAATTCCC GGTGGCCGAG
GTCAATGTCA ACTTACCGCG CTGGGTGGAC GAACTAGAAA GCGAACACTG GCTGCGACAG
CAGCTGGAGA ATGCCGTCCG GGAGGCGGTG GGTGAAGTCC GGCGCCTGCG GGATATCAAC
AACGCCATTG AAAAGCTCGG GGAGAATGAA TATGTCTCCC GGGTCGCCTT GAAGGATATG
GACCTGGGAA CAGGCACGGC CCATATCGAT ATGGGCACCC GTGAGGGCCT TTTCCATCAG
ATTTTGCGGG AAATCAGCGG CCTGGACATC AGCGGTGATC AGGATATCGT CCGCTGGCTC
CGGGAACTGG CCGGCATCAA GAAAGAGTGG GATAAGATCG CCTACGGTAT CCAGGAGGTC
CGCAATACCG GTTATGGTGT AGTGACGCCC ACTGAAGATG AGATGGAACT GGCTGAACCA
GAGCTTATCA AACAGGGCGG CCGCTCCGGG GTACGGCTCA AGGCCACGGC GCCGTCCTAC
CACTTCATCC GCGCCGACAT CACCACCGAG GTGACGCCCA TCATCGGCAC CGAGAAGCAG
TGCGAGGATC TGGTTAAGTA TATCATGGAA GAGTTCGAGG ATAACCCCCA GAAGATATGG
CAGACCAACG TCTTTGGCAA ATCCTTGAGC GACCTGGTCC GGGAGGGCAT CCAGAGCAAG
CTCTACCGCA TGCCGGAAAA CGCCCAGGTC AAACTCCAGG AGACGGTGGA GCGCATAGTC
AATGACGGTG GCGGCGGGCT GATCTGCATT ATAATTTAG
 
Protein sequence
MEKRDIFRDI TERTGGDIYL GVMGPVRSGK STFITQFMEK LVLPNIKNPN DRDRARDELP 
QSGAGRLIMT TEPKFIPNEA VEIAIREGLK MRVRLVDCVG YTVPGALGYE DNEGPRMVRT
PWFEHPVPFQ EAAETGTRKV ITDHATIGVV VTTDGSITEI PRQDYVDAEE RVIWELKELG
KPFVILLNSV HPLAEETIAL AGELETTYDV PVLPVDCLNL TEDDILHIME EALYEFPVAE
VNVNLPRWVD ELESEHWLRQ QLENAVREAV GEVRRLRDIN NAIEKLGENE YVSRVALKDM
DLGTGTAHID MGTREGLFHQ ILREISGLDI SGDQDIVRWL RELAGIKKEW DKIAYGIQEV
RNTGYGVVTP TEDEMELAEP ELIKQGGRSG VRLKATAPSY HFIRADITTE VTPIIGTEKQ
CEDLVKYIME EFEDNPQKIW QTNVFGKSLS DLVREGIQSK LYRMPENAQV KLQETVERIV
NDGGGGLICI II