Gene Moth_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1506 
Symbol 
ID3831733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1550436 
End bp1551758 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content57% 
IMG OID637829438 
ProductSpoIVB peptidase 
Protein accessionYP_430358 
Protein GI83590349 
COG category 
COG ID 
TIGRFAM ID[TIGR02860] stage IV sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTGA AACGCTTGGG GCACCGGGTC CTGGGCCTGG TGCTGGCGGC AATGTTACTC 
TACGGTGGCC TGGCTCCGCC GGTACGCAAT TTTTTTGCCC TCCCCTGGCA GCAGCGCCTG
CCGGCAGCAG CGCCAATTTC CCTGCCCTGG GAACTGCCGC CGGGTCTGGC CCGCCAGGTC
GAAGTGAAGG TTAATAGCGG GGACTGGAGC AGAGCCACTA CCGGCGATTT CCCCCGGTGG
TTACAGCTAC AATTAAAATT GTTCGGCTTT ATTCCATTAA AGAATATTAC CATTCAGCTG
GTACAACCGG TCGATGTTTC TCCCGGCGGC CAGGCTATAG GGGTCTTCTT GAAGACCGAA
GGCGTCCAGG TAGTAGGCCA GGCGGCGATT GTAGACGAGA GGGGTAACAA GGTTTACCTG
GCCCGGCAGG CAGGCCTGGA AACAGGCGAC GCCATCATCG CCATTGACGG TCAGAAGGTA
ACCAGCGACC AGGAAGTGGC CAACCTGATC AATGCCGCCG GGCAGGCCAA TCGCCAGGCC
AGGATCACCG TTAAAAGGGA AGGCCACTTG TTGACCCTGA ACATCCACCC TCGCTACTGC
CAGGAAACAG GGCGCTACCG GATTGGCGTC TATGTCCGGG ACAGTACGGC CGGCGTCGGC
ACCCTGACCT TTTACGATCA AAACAAGGGC GTTTTCGGTG CCCTGGGCCA CGTAGTTACC
GGCAGTGACG GCCAGACGGC CATGGATATC AGCGGCGGCA GGATAGTAGC GGCGGCCATC
CAGGGTATCC ACCAGGGCTA CCGGGGGCAA CCGGGAGAAA AGCTGGGCGT TTTTCTGGAA
AACGGCCAAT TTAGCGGTAC TATACAGAAG AATACTATTG TTGGCATATT TGGAACGATA
ACCGGCAAGC TTCCCGGTAA TAAAGAGATA CCAGTAGCCC TGGCCGATAC TGTCCACCCC
GGACCGGCAG AAATCCTGAC GGTTATCGAA GGAGAAAAGG TCGAAAGTTT CCAGGTGGAA
ATCGAACGGG TCATGCCCCA CCAGCGGGCC AGCGGCAAGG GCCTGGTCCT TAGGATTACC
GACCCCAGGT TACTGGCCGT AACCGGAGGC ATTATCCAGG GTATGAGCGG GAGTCCCATT
ATTCAAGACG GCCAACTGGC CGGTGCCGTA ACCCACGTCT TTATCAACGA CCCGACCCGG
GGTTACGGGG TGCTGGCGGA ATGGATGCTC CAGGAGACAG AACTTGTTCC TAAAGATAAA
GCCAGGGGTG CTACTGTCGA AACCCCTGGT TCTTTCCTTT TTGTGGTATT TTGTGTAGGA
TAA
 
Protein sequence
MPLKRLGHRV LGLVLAAMLL YGGLAPPVRN FFALPWQQRL PAAAPISLPW ELPPGLARQV 
EVKVNSGDWS RATTGDFPRW LQLQLKLFGF IPLKNITIQL VQPVDVSPGG QAIGVFLKTE
GVQVVGQAAI VDERGNKVYL ARQAGLETGD AIIAIDGQKV TSDQEVANLI NAAGQANRQA
RITVKREGHL LTLNIHPRYC QETGRYRIGV YVRDSTAGVG TLTFYDQNKG VFGALGHVVT
GSDGQTAMDI SGGRIVAAAI QGIHQGYRGQ PGEKLGVFLE NGQFSGTIQK NTIVGIFGTI
TGKLPGNKEI PVALADTVHP GPAEILTVIE GEKVESFQVE IERVMPHQRA SGKGLVLRIT
DPRLLAVTGG IIQGMSGSPI IQDGQLAGAV THVFINDPTR GYGVLAEWML QETELVPKDK
ARGATVETPG SFLFVVFCVG