Gene Moth_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1228 
Symbol 
ID3833169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1266096 
End bp1267775 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content53% 
IMG OID637829163 
Productcopper amine oxidase-like 
Protein accessionYP_430085 
Protein GI83590076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000175473 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGTA TAAACACCCG CTTGCTACCT GTAACAATAA TTGCCCTCCT GCTTTGCATG 
GCTACATTTT TAAATACGGC CCTTGCCGCC TCGCCGCCGG CCAAGCAAAT AATTTTAACC
CCGGGATCAA CGGAGATCTG GGTTGACGGG GAAAAAGCAA CCCTGCCAGC GGCACCCTAT
GTTTCTGACG GCGTTCTCAT GGTCCCCCTG AGGAGGCTGG CTGACGAACT AGGCTTTACG
GTCCAATGGC AAGAGGGACC ACCCCAGTCT ATTGTCGTAA ATTCCGGCAA CCTGCGGGCG
GAAATGTACC CGGGCACCTG GGTAGTCTTT CTTACGGGTT CTGACTACCG GGCCGTAATC
CTTCCGGCCG AGGTCCAGCA AAAAGATGGC CTCATCTTTG TTCCCACCGC TTTTTTCCAA
GATGCCTTCC GGGTACCGAT GGCAGAAAGT AAAGGAGAAA AGGGAGTTTA CCTTTTGGGA
AGCGATAATC AGCCGCCGAC TGCTTATTTT GACGTCCAGG AGCCGGTGTA TGCCGGGGAA
GAAGTGAAGT ACATAGACAA AAGTAGCGAC GGTGACGGCG ATGCAATTGT CGAACGCCAG
TGGTTAAACA AGAAGAATAT CTTCCCATCG CCCGGGGTTT ATTCTGTTAC CCTGAAAGTG
AAGGACAGCC GTGGTAGCTG GAGCAAACCC TATGTGCGGG AAATAAAGGT CCTGCCGCGG
CCGGCGACTG ATGTTCCCCG GCCGGGAGAA ATAGTAGAAA ATATCATGGG CCAGGCCGAA
AATACCTTGA AGCCCGTAAA GGCGGATAGT GGTCCCCGGT TGCTTTTTAG TGACGACCCG
GAATACATAG AGAAGCCGGG TATCCTCTAC CGGGATAAAT TAAAAGGGGA AGGCAGGCTC
TATTTCTGGC ATGACGTTAA CTCCCCGGGC TCATTGAAAG TGTATGTCCT GGCTATAAAT
ACCAGCCCCA GGGAAGCAGA AGTCAGTATC CTTAAGGAAG GTTACGGTGG GCCTTCGAAT
AACGTATACC TTGTCGCCAG GACGGCCTTT ACGGCTTACT ACCATTCCCA GGGGCAGCGA
AGGTATACCC TCAAACCAGG GCAAATTTTA GTGCTCAATC CCGGTGCCCC GGCAGCGGTA
CGCTATCAAG TGGTCCACGG CATTATCGAC CTGAAAACCA GTGAGGAAAT TACGGTGGCC
TTTGTGGCCG TCCCGGCTAC GGTAAATGTC TTGGAAGCCT ACAGCCGGCT GGGAGTGCTT
CCCAGGGACG GTGTGCACGT CCGGGGTACG TTTGCCGCAG CCGACAGGGA AATGACCATC
GACCTCCGGG GGGCAAAGAC CGGTTCCATT TTACTGGCTG ACGGCAGCGA TGATAAGTAT
ATGGCCGGGG TGGATGGGAT TACCGGTTCG TCGGTCTGGA ACGCAGGTAA TTACGGCATG
CTCTACCGGC TAAAAATCAA ATCAGATAAA AAAACAGGAG TTTATTTAAT CCCTGCCGGG
GGCAGTTTTG GGGGCACCCT GATTTTTAAC GCCGGGGAGG TGTCGGTACC GTTAGAGGGC
TTTATTTCCT CACCAGCCCA GGCTGTTTAT CTTGGAACCA CGGTCCCCGA GGGCATCACC
GAGATGCTTT TTATGTCTCC CGGTGGTTCC TGCCTGCCGG TAAAGCTGTT GTTCAAGTGA
 
Protein sequence
MQRINTRLLP VTIIALLLCM ATFLNTALAA SPPAKQIILT PGSTEIWVDG EKATLPAAPY 
VSDGVLMVPL RRLADELGFT VQWQEGPPQS IVVNSGNLRA EMYPGTWVVF LTGSDYRAVI
LPAEVQQKDG LIFVPTAFFQ DAFRVPMAES KGEKGVYLLG SDNQPPTAYF DVQEPVYAGE
EVKYIDKSSD GDGDAIVERQ WLNKKNIFPS PGVYSVTLKV KDSRGSWSKP YVREIKVLPR
PATDVPRPGE IVENIMGQAE NTLKPVKADS GPRLLFSDDP EYIEKPGILY RDKLKGEGRL
YFWHDVNSPG SLKVYVLAIN TSPREAEVSI LKEGYGGPSN NVYLVARTAF TAYYHSQGQR
RYTLKPGQIL VLNPGAPAAV RYQVVHGIID LKTSEEITVA FVAVPATVNV LEAYSRLGVL
PRDGVHVRGT FAAADREMTI DLRGAKTGSI LLADGSDDKY MAGVDGITGS SVWNAGNYGM
LYRLKIKSDK KTGVYLIPAG GSFGGTLIFN AGEVSVPLEG FISSPAQAVY LGTTVPEGIT
EMLFMSPGGS CLPVKLLFK