Gene Moth_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1952 
Symbol 
ID3832302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2028770 
End bp2030287 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content61% 
IMG OID637829883 
ProductABC transporter related 
Protein accessionYP_430793 
Protein GI83590784 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00677494 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGTAC CGCTGGTTGA AATGCGGGGG ATAACCAAGG TTTTTCCGGG CGTGGTGGCC 
AACGAGGGGG TCAACCTGGC CGTCCACGCT GGGGAAATTC ACGCCCTGCT GGGGGAGAAT
GGGGCGGGAA AAAGTACCCT GATGAGTGTT CTTACGGGCC TGTACCGGCC CGATGGCGGG
GAGATCTACC TGGACGGCCG GAGGGTCAAT TTCCGCTCGC CCCGGGACGC CATTGAGGCG
GGTATCGGCA TGGTCCACCA GCACTTTCGG CTGGTGGCGC CCTTCACGGT AACGGAGAAT
GTGGCCCTGG GGCTTAAAGG CGGCCTTAAA CTCAACGTAA ACCGGCTGGC CGGGGAAATA
GCCGCGCTTT CTAAAGAATA CGGCCTCCAG GTGGACCCCC AGGCGCGGAT CTGGCAGCTT
TCAGTGGGGG AACAACAGCG GGTCGAGATC ATCAAGCTCC TGTACCGGAA GACCCGGGTC
CTGATTCTCG ATGAACCGAC GGCCGTCCTG ACCCCCCAGG AAGCCCGGGA TCTGTATCGG
ACCTTGAAAA GAATGGCTGC CCAGGGCTGC GCCGTGATCT TCATTACCCA TAAACTCCAG
GAAGTTATGG ACGCCGCCGA TACCATTACT ATTCTCCGGG GTGGCAAGAC GGTGGCCACA
GTGAAAAAGA GTGAAACCAG CGAAAAGGAA CTGGCCCGGA TGATGGTCGG CCGGGAGATT
GTCTGGCAGG GGGATAAGCC TGTAGCTAAA AAGGGCGAAA AGATCCTGGA GATCAGAGAT
TTAAGGGCTC TAAACGATAA AGGCCTGCCA GCTCTACGGG GTATCAACCT GGAGGTCTTT
GCCGGTGAGA TCCTGGGTAT TGCCGGGGTG GCCGGCAACG GCCAGCGGGA GTTGGCCGAG
GTCATCGCCG GCTTGCGGCC CTGCCAGGGC GGCAGCATTA CCGTTTCCGG GCAGGAACTA
GGCCAGTGCG ACCCCTGCCG GGTAATCCAG GCCGGAGTGG GCTATATACC CGAAGATCGC
CTGGGTACGG GCCTTATACC CAACCTGGGA GCTGTCGACA ACCTGCTCCT GAAGGAATAC
CGCCATCCCC GCTGGGGTAG GACCATCATG AACCGGAGGG CAGCGCGCCA GTGGGCCGGC
GAACTGGTGC AACACTTCAA AGTAAAGATG GCCGGTCTGG ACGCCCCGGT GAAAATGATG
TCCGGCGGCA ACCTGCAGCG GCTGCTCCTT GCCCGCGAGA TTTCCTCCCG GCCGCGCCTG
TTGGTGGCCG TCTACCCGGC CCGGGGCCTG GACGTTGGCG CTACGGAAAC GGTCCACCGC
CTGCTCCTGG AGCAGCGGGC CGCCGGTACG GCCATTCTTC TGATTTCCGA GGACCTGGAG
GAGCTCTTCC GCCTGGCCGA CCGCATAGCC GTAATGTATG AAGGCCAGAT TATGGGCTTG
ATGCCCACTG AAAAGGCCAG CGTGGAGGAG CTGGGCCTGA TGATGGCCGG GGCGAAAAGA
ATGGAGGTCG GAGCCTGA
 
Protein sequence
MAVPLVEMRG ITKVFPGVVA NEGVNLAVHA GEIHALLGEN GAGKSTLMSV LTGLYRPDGG 
EIYLDGRRVN FRSPRDAIEA GIGMVHQHFR LVAPFTVTEN VALGLKGGLK LNVNRLAGEI
AALSKEYGLQ VDPQARIWQL SVGEQQRVEI IKLLYRKTRV LILDEPTAVL TPQEARDLYR
TLKRMAAQGC AVIFITHKLQ EVMDAADTIT ILRGGKTVAT VKKSETSEKE LARMMVGREI
VWQGDKPVAK KGEKILEIRD LRALNDKGLP ALRGINLEVF AGEILGIAGV AGNGQRELAE
VIAGLRPCQG GSITVSGQEL GQCDPCRVIQ AGVGYIPEDR LGTGLIPNLG AVDNLLLKEY
RHPRWGRTIM NRRAARQWAG ELVQHFKVKM AGLDAPVKMM SGGNLQRLLL AREISSRPRL
LVAVYPARGL DVGATETVHR LLLEQRAAGT AILLISEDLE ELFRLADRIA VMYEGQIMGL
MPTEKASVEE LGLMMAGAKR MEVGA