Gene Moth_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1456 
Symbol 
ID3831342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1502035 
End bp1503696 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content57% 
IMG OID637829389 
ProductABC transporter related 
Protein accessionYP_430309 
Protein GI83590300 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01166] cobalt transport protein ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.539244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCTTAT TTCAAAGCGA AAATTTAATT TATTACTACC CGGATAGGGA AAAGCCGGCC 
TTGAAAGATA TCAATTTGCG TATTGAAGAA GGGGAGTTTT TATTGATAAC CGGCGGTTCG
GGATCGGGTA AGTCTACCTT AGCGCGGGTG CTGGCCGGCC TGATCCCGGA TTTTTACGGC
GGCCGCTTTG GTGGCAAGGT TTATTTTCAA GGGCGGGACA TGGGCCAGAT GAACCGGCGA
AAACTGGCCC GGGAAGTGGG GATGGTCTTC CAGGATCCGG AAAAACAACT GGTTATGACC
AGTGTCGAGG CCGAGATCGC CTTCGGCCTG GAAAACCTGG GTCTGCCCCG GGCAGAGATG
TCCCGGCGGG TTGCCGAGGT CTTGAGTTTT CTGGACCTGA CGGAAGTCAG GCAGGAATTT
ACCGCGCACC TTTCCGGTGG GCAGAAGCAA AAGCTGGCCC TGGCTGCTAT ACTGGCCATG
CAGCCGCGGG TGCTGGTTTT AGATGAGCCT ACCTCCCAGC TGGACCCGGT AGCGGCCGAG
GAATTTTTTA ATCTCATTAA ACGGTTAAAT GAGGAAATGG GCCTGACCAT AATTTTGATC
GAGCAGCGGC TGGAGAGGTG TTATCACCTG GCCGACCGGG TAGTGTTCAT GGAGGACGGC
CAGCTCAAAT ATGAGGGCAC GCCGGAGCAA CTGGCCCGCT GGGCGGTGCA GCGGGACATC
CCCTTTGTAC CCCCGGTGGC CCGTTTTTTT GCCCGGATAG GTTTCCCTTC TATTCCCGTT
ACCGTCAAGG AAGGGCGCCG GTTACTGCGG TCCAACTTTG ACCGCCGGGA GTTTCCCCCT
CTAAAGCCGG CGGTAAAGGC AGAACCGGGA GAACCGGTTT TGACCATGAG TAAGGTATGG
TTTACCTATC CCAATGGTAA AGAAGCCCTG CAGGACGTAA GTATCCAAAT CGCTACCGGC
GAACTGGTAG CTATCCTGGG CGCCAACGGC GCCGGTAAAT CCACCCTCCT GAAAACCATG
GCCGGCCTCT TAAAACCGGG ACGGGGCCGG GTGCAGGTAA TGGGCCGCGA CCTGAGTAAC
GAGGGCCGGC CCGGGGACGG CAGGATTGCC TACCTTTCCC AGAATCCCGG TGATTATCTC
TTCCAGGATA CCGTGGAAGA GGAATTGTTA TTTACTCTAA AAAATTTCGG CCTCCCTAAT
GACGGCATTG TTGATGAACT CCTGGAGAAG TTAAACCTAC AGCGCTACCG GCGGGTAAAC
CCGCGCGATT TGAGCAGCGG CGAGCGCCAG CGGGTCGCCC TGGCCTCCAT TTTGGTAACA
CGGCCCCGGC TCCTGGTGCT TGACGAACCT ACCCGGGGGA TGGATTATCG CTTGAAGGAC
GAACTGGGAG AATTGTTGAC GGGCTTAAGG AGGGAGGGAG TAAGTGTGGT GCTGGTGACC
CATGATATAG AATTTGCTGC TGCTTATGCC ACGCGGGTGC TGCTGCTGTT TGCCGGCCGG
ATCGTAGCCG ATGGGCCCAA GCACCAGGTC CTGGGCCAGT CGGTTTTTTA TTCCACCCAG
ATTGGCAAAA TGTGCCGCGG CTATGTTGAC GGTGTCCTGA CCCTGCAGGA TGCCCTGGAC
CGGCTGGCAC CCGCATGGCC GGCCAGGCAG GTAGTTTCAT AA
 
Protein sequence
MPLFQSENLI YYYPDREKPA LKDINLRIEE GEFLLITGGS GSGKSTLARV LAGLIPDFYG 
GRFGGKVYFQ GRDMGQMNRR KLAREVGMVF QDPEKQLVMT SVEAEIAFGL ENLGLPRAEM
SRRVAEVLSF LDLTEVRQEF TAHLSGGQKQ KLALAAILAM QPRVLVLDEP TSQLDPVAAE
EFFNLIKRLN EEMGLTIILI EQRLERCYHL ADRVVFMEDG QLKYEGTPEQ LARWAVQRDI
PFVPPVARFF ARIGFPSIPV TVKEGRRLLR SNFDRREFPP LKPAVKAEPG EPVLTMSKVW
FTYPNGKEAL QDVSIQIATG ELVAILGANG AGKSTLLKTM AGLLKPGRGR VQVMGRDLSN
EGRPGDGRIA YLSQNPGDYL FQDTVEEELL FTLKNFGLPN DGIVDELLEK LNLQRYRRVN
PRDLSSGERQ RVALASILVT RPRLLVLDEP TRGMDYRLKD ELGELLTGLR REGVSVVLVT
HDIEFAAAYA TRVLLLFAGR IVADGPKHQV LGQSVFYSTQ IGKMCRGYVD GVLTLQDALD
RLAPAWPARQ VVS