Gene Moth_0242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0242 
Symbol 
ID3832570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp243212 
End bp244822 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content61% 
IMG OID637828178 
ProductIg-like, group 2 
Protein accessionYP_429120 
Protein GI83589111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGTA GAATGTTCGC TTTGAAAAAG CCGGCGGGAG CCTTCGGGCT CCTGCTGGCC 
TTGATTTTGA TATTCGGATT ACCTGGTTAT GCCCTGGCCG TGCCTGCGCA GCCGCACCAG
TTCTACGGTC AGGTGACTGT AGGAGGAGAG CCAGCGCCGC AGGGCATTGA AGTAGTGGCC
AAAATTGATG GCGTCCAGTA CGCGGCCACT GTTACCGACG CCCTGGGCCG GTACGGCTAC
GATCCTGTGT TCTACGTTCC GGCTGACAAT CCCGATACCC CCGCCATAGA GGGAGGGCGC
AGGGGGGATC TCATTGAATT CTACGTGGCT GGTGTCCTGG CCGGAAGCTA TCAATTTGAA
GTTGGCGGCG TGACGAAGCT TGACCTCTCC ATAGCAGCAC TGCCGGACAC CACCGCGCCG
ACTGTGAGCA TAACTGACCC GGCGAACAAC GCGACCAGCG TTGCGGTTGA CAAGACCATC
ACGGTGACCT TCAGCGAGGA TGTGAAGGCT GGTGCTGCCT ACGACGGCAT TACCCTCAAA
GACGCTGGAG GTTACCCCGT TGCCGTGACC AGGAACATTG TCGGCAAAGT CCTGACCATT
AAGCCGAACG CTAACCTTAC CAACAGCACC ACTTACACCG TGGTCATTCC AGCCGGAGCG
GTGGCTGACC TGGCGGGCAA AGCCCTGGCG CAGGATTACA TCTTCAACTT CACAACCGAA
GCGGCGCCCG TGACGCTTCA GGCACTGAAG GTCGAGCCGT CCAGCTTCAC CCTTACCGTG
GGCGAGACCA AGCAGCTCGT GGTGAAGGCG GTTTACTCCG ACGGCAGCGA GGCGGACGTG
ACGGATGAGG CGACCTACGC TTCGGCGAAT GAGAACGTGG CCAGGATAAG TGCCACAGGC
TTGATCACCG CGGCGAGCGC AGGCGAGACG GTGATAACGG CCACCTACGG CGATAAAGAG
GCCCAGGTTG CAGTGATTGT ACAGGTGTCT CCTGCAGGTC CCGCGCCCAC AGTTACGGCT
GTCGATCCGA CTAATGCCGT GGCGGGCCAG AGCGGACAGA CCATCACCGT CACCGGCCAG
AACTTCCAGG ACGGTGCCCA GGTGGTCCTG CTGCAGAACG GGCAGGAAGT ATCTGCTGTA
AACGCGGTCT ACAGCGATTC CAGCCGGGTA ACATTTACCC TCCCCGTCAG CGTGCCGCCA
GGTGTTTACA CCGTCGCCGT ACGCAACCCC GACGGCCAGC AGTCCGCTGA CGCCGTAGCC
CTGGTCCTTT ACGCCGGCCC GAAACCGGCG ATGAACGTCT ATCCCGGCGT CCAGGGCACC
CAGCCGGGCC AGGTGCAGGC CGGCGGCGGC GTGCGGGTAG AGGTCCCCGT CCTGGTAAGT
CAGATCTATC CAAATGCGCT GGTGATCATC AGGGTAGACG ACCCTGACGG AAAGCCGCTT
TACGCTTCGG TAGAGGGTCC GCTACCGGCC AACACCCCGC TGAAATACTC CGCCAGCTTC
AACCTGCCGG GCAAGGCCGG CAACTACACC GTGAAAGCCT ATGTCTGGGA CGGCTGGCAG
ACCATGAACC CCATCGTGCC GGCGTCGCAA GCGCAATTCA CAGCCCAGTA A
 
Protein sequence
MFSRMFALKK PAGAFGLLLA LILIFGLPGY ALAVPAQPHQ FYGQVTVGGE PAPQGIEVVA 
KIDGVQYAAT VTDALGRYGY DPVFYVPADN PDTPAIEGGR RGDLIEFYVA GVLAGSYQFE
VGGVTKLDLS IAALPDTTAP TVSITDPANN ATSVAVDKTI TVTFSEDVKA GAAYDGITLK
DAGGYPVAVT RNIVGKVLTI KPNANLTNST TYTVVIPAGA VADLAGKALA QDYIFNFTTE
AAPVTLQALK VEPSSFTLTV GETKQLVVKA VYSDGSEADV TDEATYASAN ENVARISATG
LITAASAGET VITATYGDKE AQVAVIVQVS PAGPAPTVTA VDPTNAVAGQ SGQTITVTGQ
NFQDGAQVVL LQNGQEVSAV NAVYSDSSRV TFTLPVSVPP GVYTVAVRNP DGQQSADAVA
LVLYAGPKPA MNVYPGVQGT QPGQVQAGGG VRVEVPVLVS QIYPNALVII RVDDPDGKPL
YASVEGPLPA NTPLKYSASF NLPGKAGNYT VKAYVWDGWQ TMNPIVPASQ AQFTAQ