Gene Moth_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0097 
Symbol 
ID3832668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp94586 
End bp95572 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content61% 
IMG OID637828029 
ProductNLPA lipoprotein 
Protein accessionYP_428979 
Protein GI83588970 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA GGGGACTGGC CCTTCTTTTA CTCCTACTCT TCCTGGTCCC TGTCGTATCT 
GGCTGCGGCA GCCCGGCTAG CAGCGGGAGC AGCCAGGAAG AGAGCCTGAA GCTGGGGTTA
ATCCCGGTGG AGGATAACTT CCCCTTTTTT GTCGCCGAGA AGGAAGGCCT CTTTACCAAA
GCCGGCTTGA AGGTTGAACT GGTACCCTTT AATAGTGCCC GGGATCGCGA TCTGGCCCTG
CAGTCCGGGA GTATCGACGG CGAGGTGGCC GATATTGTCG CTACGGCCCT GCTACGTAAA
GGCGGAACGC CGGTGAAGAT CGTCTCCCTG ACCATGGGGG CCACCCCGGC CGAGGGACGC
TTCGCCCTCC TGGCCCGGCC CGGGGCCGAT ATCAGCTCCC CCGGCCAGCT CAAAGGCCGG
ACCGTCGGCA TCTCGGAAAA CACCATCATC GAGTATGTCG CTGACGGCCT CCTGCGGGAA
GGAGGGGTAG ACCCCGGCTC CGTCCAGAAA GTCGCCGTAC CCCAGATCCC GGAACGGCTC
CAGCTCTTGC TGGGTGGTAA GTTGGACGCC GCCCTGCTGC CTGATCCCTT TGCTTCCCTG
GCCGCCAGGA AAGGGGCCAG GGTGATCCTG GACGATACGA AAATTAACCG CAATCTCTCC
CAGGTGGTAC TTATCTTCCG GGAGGAAGCC ATCAAACATA AGACACCGGC TATTAAGAAG
CTACTCCAGG TATATGCCGG GGCCGCGAGC TTGATTGCCC GGAACCCCTC CGCCTACCGG
GAGCTATTTA TTGAAAAGGC CAGGATACCG GCGGAACTCC GGGACACCTA CCTGGCGCCC
CAATACTCCC CGCCGCAACT GCCCCGGCAG GAGGAGGTCG CGGCGGTGAT GGACTGGATG
GTGGCCAAAA AACTCCTGGC CGCACCCTAT AAATACGAAG AGCTGGTTGA CCCGGATTTG
GTTAACCCCG GTGGGAACAA CCGGTGA
 
Protein sequence
MKIRGLALLL LLLFLVPVVS GCGSPASSGS SQEESLKLGL IPVEDNFPFF VAEKEGLFTK 
AGLKVELVPF NSARDRDLAL QSGSIDGEVA DIVATALLRK GGTPVKIVSL TMGATPAEGR
FALLARPGAD ISSPGQLKGR TVGISENTII EYVADGLLRE GGVDPGSVQK VAVPQIPERL
QLLLGGKLDA ALLPDPFASL AARKGARVIL DDTKINRNLS QVVLIFREEA IKHKTPAIKK
LLQVYAGAAS LIARNPSAYR ELFIEKARIP AELRDTYLAP QYSPPQLPRQ EEVAAVMDWM
VAKKLLAAPY KYEELVDPDL VNPGGNNR