Gene Moth_1951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1951 
Symbol 
ID3832301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2027649 
End bp2028770 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID637829882 
Productinner-membrane translocator 
Protein accessionYP_430792 
Protein GI83590783 
COG category[R] General function prediction only 
COG ID[COG4603] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00563372 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAGCTA CAGCCGTACC CAAGAATACC AACCGGAGTT CCGGGCCGGG ATCTGGACCC 
GCGTTAACCT TAGAAAAGCG CCTGGAACCC TCGCGCTTTA TGGCCGTGGT AGTACCCGTT
ATATCCGTCA TCCTGGCCCT GGCCGTCGGG GCCATCTTCC TGGCGGCCAC CGGCTTTCAA
CCAATGAAGG TCTACCAGAG CATGCTCAAC GGTGCCGTCG GTTCCAAGTA CGGTATCTCG
GAAACCATCG TCAAGGCTAT CCCCCTGATG CTGGCGGGCC TGGGGGTTTC GGTGGCCTTC
CGCATGCTCC TCTGGAACAT CGGCGCTGAA GGCCAGTTCT ATATGGGCGC CTTTGGCGCC
AGTTGGGTGG CCCTGACTTT TCCCCATTTA CCGGCTTACA TTATGCTGCC GGCCATGTTC
CTTGCCGGGG GCTTGATGGG GGCCCTGTGG GGATTGCTGC CGGCCTTGCC CCGGGCCAAA
TGGGGCGTCA ACGAGGTCAT TACCACCCTG ATGCTCAACT ATGTAGCCAT CCTCTGGGTG
GACTACTTGG TTTACGGTCC CTGGAAGGAC CCCAAGGGTT TTAACTTTCC CCTCACGGCC
ACCTTCAGCG ATGCCGCAGC GCTACCTACC ATTGCCGGCA CCAGGGTGCA CGTGGGATTG
ATCTTTGCCC TGGTGGCGGC CGTGCTCCTC GCCATTATCC TCTGGCACAC CAGGTGGGGT
TATGAGATCC GGGTCATCGG CGAGAGCGCC CGGGCCGCCC GTTACGCCGG CATGAATATC
GAACGCAATA TTATCCTAGT TATGCTCCTT AGCGGCGCCC TGGCCGGGCT GGCCGGCATG
AGCGAGGTGG CCGGCATCAC CCACCGCCTC CAGCACGGCA TCTCCCCGGG ATACGGCTAT
ACCGCCATTA TCATCGCCTG GCTGGCCAAG CTGCACCCGG CGACCATCAT CCTGGTTTCT
ATCCTCTTCG GCGGTCTCAT TGTCGGCGGG TACAGCGTCC AGACTTCCGG GGTACCGGCG
GCCACGGTAT CAATGCTCCA GGGGGCCATC CTCTTCTTTG TCCTTGGCGG TGAGATCCTG
ACCCGTTACC GGTTGCACTT CGGTCGTAAG GAGGGAAAAT AA
 
Protein sequence
MPATAVPKNT NRSSGPGSGP ALTLEKRLEP SRFMAVVVPV ISVILALAVG AIFLAATGFQ 
PMKVYQSMLN GAVGSKYGIS ETIVKAIPLM LAGLGVSVAF RMLLWNIGAE GQFYMGAFGA
SWVALTFPHL PAYIMLPAMF LAGGLMGALW GLLPALPRAK WGVNEVITTL MLNYVAILWV
DYLVYGPWKD PKGFNFPLTA TFSDAAALPT IAGTRVHVGL IFALVAAVLL AIILWHTRWG
YEIRVIGESA RAARYAGMNI ERNIILVMLL SGALAGLAGM SEVAGITHRL QHGISPGYGY
TAIIIAWLAK LHPATIILVS ILFGGLIVGG YSVQTSGVPA ATVSMLQGAI LFFVLGGEIL
TRYRLHFGRK EGK