Gene Moth_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1901 
Symbol 
ID3831174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1965900 
End bp1966871 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content58% 
IMG OID637829834 
ProductABC transporter related 
Protein accessionYP_430744 
Protein GI83590735 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.298526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAATG ACAATCCTGC CATGATCATT GCAGCGCGCG GTCTGGTGAA AAGCTTCGGA 
CCGATCCGGG CCGTGGATCA CATTGATTTA CAGGTGGAGA AGGGGGAAAT TTTCGGCCTG
GTAGGACCGG ACGGGGCCGG GAAGACAACG ACCATGCGCA TGCTGGCAAC TATCCTCCCG
GCTGATGCCG GAGCAATATC CGTTCTGGGT TATGACGGCC GGACGGAAGC TGAGCGCATT
AAGGAGCACA TTGGTTATAT GCCCCAGCGG TTCAGCCTGT ACGGGGATCT AACAGTGGCG
GAAAACCTGG AATTCTACGC TGAAATCTAT GAAGTTCCCC GAAAGGTGCG GGAGCAGCGC
AAAAAAGATC TCCTGGCGTG GGCCAACCTT ACCCGGCATA GCTATAAACA GGCCGATCAG
TTGTCGGGGG GAATGAAACA AAAACTGGCC CTGGCGTGTA ACCTGATCCA CGAACCCGCC
GTTCTTTTCC TGGACGAACC CAGCACGGGA GTAGACCCGG TGGCGCGGCG CGATTTCTGG
CGCATCCTCT TCAGGTTGCG CGAGGAGGGG GCGACGATCA TGGTCAGCAC GCCTTATATG
GATGAGGCCG AGCGTTGCGA CCGCATCGCC TTTACTTATA ACGGCCGCAT CCTGACTTGC
GGTACCCCGG CGGCAGTTAA GAATTTATTC CGGGGCCAGC TCCTGCTCTT GCGTGCGGAG
ACTATTGCCA TGCTCCACGC CGCCAGGGAC TACCTCCGCC GGGAACAATT GCTGGCTGAT
GTCTTGATTT ATGGCGACGC TTTGCACCTG GTGACCGACG ATGCCCTGGA AACGGCAAGG
CTTCTACCGG GGCTCCTGGA ACGCCAGGGT ATCCGGGTTA CCCATCTCCA GCCTATTCCT
CCTTCTCTGG AAGATACCTT TGCTTACCTG GTCAGACAGG CAGGAGGATT CGCGGGGAGG
GAGTCCGCTT GA
 
Protein sequence
MVNDNPAMII AARGLVKSFG PIRAVDHIDL QVEKGEIFGL VGPDGAGKTT TMRMLATILP 
ADAGAISVLG YDGRTEAERI KEHIGYMPQR FSLYGDLTVA ENLEFYAEIY EVPRKVREQR
KKDLLAWANL TRHSYKQADQ LSGGMKQKLA LACNLIHEPA VLFLDEPSTG VDPVARRDFW
RILFRLREEG ATIMVSTPYM DEAERCDRIA FTYNGRILTC GTPAAVKNLF RGQLLLLRAE
TIAMLHAARD YLRREQLLAD VLIYGDALHL VTDDALETAR LLPGLLERQG IRVTHLQPIP
PSLEDTFAYL VRQAGGFAGR ESA