Gene Moth_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0737 
Symbol 
ID3831129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp769688 
End bp771208 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content61% 
IMG OID637828668 
Productamino acid permease-associated region 
Protein accessionYP_429598 
Protein GI83589589 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID[TIGR00909] amino acid transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000351708 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCT GGCGAACCAA AAAAATAGCT GACCTGATGC AGGAAGCTGG AGACAATCAA 
AAGCTCAAGC GGAGCATTGG CACGCCGGAA CTAGTCGCCC TGGGTGTAGG GGCAATTATC
GGCTCCGGGA TTTTTGTCCT GACGGGAGTG GCTGCCGCCA ACTACGCCGG CCCGGCCCTG
GTTTTCTCCT TTATCCTCTC CGGCCTGGCG GCAGGCCTGG CGGCCCTCGT TTACGCTGAA
ATGGCCGCCA TGATCCCGGT GACCGGCAGC GCCTATACCT ATGCTTACGC TTCCCTGGGC
GAGATTATCG CCTGGCTCGT GGGCTGGAAC CTGGTACTGG AATACCTGGT GGCCTCGGGG
GCCGTGGCTG TAGGCTGGAG CGGTTACATC ACCGATATGC TGGCGTCTGT GGGGGTCTTT
CTGCCCCGGG CCCTGGTCAA TTCTCCCTTA AGCGGCGGCC TGGTTAACCT GCCGGCTATC
TTGATCACCG TCGTTATGAC CGGAGTGGCC ATTGTCGGCA CCACCACCAG TGCCCGGACT
AATAAGATTA TCGTCGGGGT AAAAATCCTG GTGATCCTGG CCTTCCTGGC CTTAGGCGCC
CCGCGGGTAA ACCCGGCCTA CTGGCACCCC TTCCTCCCCT TCGGCGTCAC CGGTGTCGTC
CACGGGGCGG CTATTATCTT TTTCGCCTAC ATCGGCTTCG ATGCCGTGGC TACTGCCGCC
GAGGAGGTGC GCGACCCGGC ACGGGAATTA CCCCTGGGAA TCATCGGCTC CCTGGCCCTC
GCCACTATAT TATATGTGGC CGTTACCATT GTTCTGACTG GTTTGACGCC CTACACCAAC
TTGAACACCC CTTCCCCGGT GACCACCGGC CTGCTGGCTG CCGGCGTCCG TGGAGCTTCC
CTTATTGTGG GCACCGGCGC CCTGGCCGGT TTGACAAGCG TCCTGCTGGT AAACATCTTT
GCCCAGAGCC GGGTCTTTAT GGCCATGGGC CGCGACGGCC TGCTGCCCCC TCTCTTTACC
AGGGTCCACC CTCGCTTCCA TACCCCCTGG CTGACAACAT TAATCGTCGG CGCCTTTATC
ACCCTCATCG GCGGCTTCCT GCCGGTGGAT ATCATTGCCG AGCTGGCCAA TGTAGGTACC
TTGTCTGCCT TTTTTGTGGT TTCCGTGGGC GTCATGGTCC TGAGGCGCAC CCAGCCGGAC
CTGAAACGGC CCTTCAAAGT GCCCCTGATG CCCTGGACGC CCCTCCTGGC CATAGCATTT
GCCGTTTACC TTTTCTTCAA CCTGCCAGGT CTAACCTGGA TTCGTTTCGG CGTCTGGGTA
ACCCTGGGAC TGGTGGTTTA TTTCGCCTAC GGCCGTCGCC ATAGCGTCCT GGCCCGGGAG
GAGGAATCTA AAGCCGTCCC CCGGCCGACC TACCGGCCCT CACCCCTGGA GATGCCCGCA
CCGGCTCGAA AGCCTTTCCC CTTCAAGCTA CCGGCCCCGG ACCTCTTGCG TATGTTCCTT
CCCCGGTGGC GGCGGGATTA A
 
Protein sequence
MGIWRTKKIA DLMQEAGDNQ KLKRSIGTPE LVALGVGAII GSGIFVLTGV AAANYAGPAL 
VFSFILSGLA AGLAALVYAE MAAMIPVTGS AYTYAYASLG EIIAWLVGWN LVLEYLVASG
AVAVGWSGYI TDMLASVGVF LPRALVNSPL SGGLVNLPAI LITVVMTGVA IVGTTTSART
NKIIVGVKIL VILAFLALGA PRVNPAYWHP FLPFGVTGVV HGAAIIFFAY IGFDAVATAA
EEVRDPAREL PLGIIGSLAL ATILYVAVTI VLTGLTPYTN LNTPSPVTTG LLAAGVRGAS
LIVGTGALAG LTSVLLVNIF AQSRVFMAMG RDGLLPPLFT RVHPRFHTPW LTTLIVGAFI
TLIGGFLPVD IIAELANVGT LSAFFVVSVG VMVLRRTQPD LKRPFKVPLM PWTPLLAIAF
AVYLFFNLPG LTWIRFGVWV TLGLVVYFAY GRRHSVLARE EESKAVPRPT YRPSPLEMPA
PARKPFPFKL PAPDLLRMFL PRWRRD