Gene Moth_1583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1583 
Symbol 
ID3832088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1617514 
End bp1618857 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content57% 
IMG OID637829512 
ProductVWA containing CoxE-like 
Protein accessionYP_430432 
Protein GI83590423 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000025338 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.503012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACAA TTCCGGAAGT CCTGCAGGAG ATAGCTACCT TATTGCGCGC CGGGGGTATA 
GAGGTGACCC TGGCGGAAAT GGTGGATGCT GTACGGGGCT GGCAGACCAC CCCGGGGGCT
TACTGGCCGG AGGTTTTGCA GGTAACCCTG GTCAAGAGAT ACGAGGATTT GAAGCCCCTG
GCGGCCTTGC TCTCCCTCAC CGGCAGCCAT GGCTGGCCGG AACAGCCTGC CTGTAAGAAC
AGGGGTATGG CTTCCGGGCA GGCTACGCGG CACGCAGGCG GGCAGTTGCC GGTCCAGGAC
GTAATGGCAG CTCTTTTTAA TGGTTCTGAT AAAGACCTCA ACGACCTGGC GGAGCAGGCC
ATCGACCTGC TGGGAGAATT GAATGCCGAA GCTCTGGACC ACCTGGAGGG TAAGGTAAGG
GAAGCCAGGT TGGCTTTAAA CTGGCATATG GTCCGCCACA GGTTAAGGGT TATGGAAAGC
GAGGGGCGGC AGGAGGCCCG GGAGGCCCAA CAGAGGTTGC AGCTATTGGC AGCGATTATT
CGCCGGAACC TGGAGTTGCG CCTGGTGCAA GAGTTTGGTC CTGAAGCCAT GCTGGCTATT
CTCCGTACCT ACAATCTGGC CGAAAAAGGA TGGAGCGAGC TGGCTGAAGG GGATCTGGCC
GTTATGCGAC CGTATTTAAA GAAACTGGGT CGCTATCTGG GAAATAAGTA TTCCTGGCGT
TACCGGCCCG CTCACCGGGG GAAGATCGAC CTGCGCCGCA CTGTTAAAGA AGCCTGCCGG
CACGGCGGCG TTCCCTGGCA ACTCCGCTAC CGCGACCGTC GCCGGGAGAG GCCGGTCCTT
TTTGTCCTGG GCGACATTTC AGGTTCGGTA GCGCCCTTTA GCGTCTTTAT GCTGGAATTG
ATCTACGCCA TGCAGCATGC CTTTCGCCAG GTCCGGACCT TTGTTTTTGT TGATGACCTG
GCTGAAGTTA CCAACGCCAT CAGGGAATCG CAGGACGCCG GGGCCATGGA GCAAGTGGCC
CGTTTCGCCC GCTGCTCGGT TAGCGGTTAC TCCGACTTCG GGCGTGTCTT CAAGCTGTTT
CTGGAACGCT ATGGAGAGGT TTTAACCCCT GAGACTACAA TTTTAATCCT GGGCGACGCG
CGCAATAACT GGCGGCAACC GGAGGTCGAT AGTTTTGCAG CTATTTGCCG TAAAGCAGGA
AAGGTAGTAT GGCTGAACCC GCAACCGGAA GCCTCCTGGA ATACCATGGA TAGCTCCATG
GCTCTTTATG CCCCCTTCTG CCACGCTGTC AGGGAATGCT CGAATCTAAA GCAGCTAATT
GCCATTGCCA GGGAGGGGCT TTAA
 
Protein sequence
MDTIPEVLQE IATLLRAGGI EVTLAEMVDA VRGWQTTPGA YWPEVLQVTL VKRYEDLKPL 
AALLSLTGSH GWPEQPACKN RGMASGQATR HAGGQLPVQD VMAALFNGSD KDLNDLAEQA
IDLLGELNAE ALDHLEGKVR EARLALNWHM VRHRLRVMES EGRQEAREAQ QRLQLLAAII
RRNLELRLVQ EFGPEAMLAI LRTYNLAEKG WSELAEGDLA VMRPYLKKLG RYLGNKYSWR
YRPAHRGKID LRRTVKEACR HGGVPWQLRY RDRRRERPVL FVLGDISGSV APFSVFMLEL
IYAMQHAFRQ VRTFVFVDDL AEVTNAIRES QDAGAMEQVA RFARCSVSGY SDFGRVFKLF
LERYGEVLTP ETTILILGDA RNNWRQPEVD SFAAICRKAG KVVWLNPQPE ASWNTMDSSM
ALYAPFCHAV RECSNLKQLI AIAREGL