Gene Moth_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0161 
Symbol 
ID3831873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp161222 
End bp162274 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content67% 
IMG OID637828100 
ProductATP:guanido phosphotransferase 
Protein accessionYP_429042 
Protein GI83589033 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3869] Arginine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGA ACCTGGAGCG CAACAGCAAA TGGATGGAGG GCTCCGGCCC CCAGGCCGAT 
ATCGTCATTT CCAGCCGCAT CCGCCTGGCC CGCAACCTTA AAGGCCTGCC CTTCCCCAAC
CTCATGAACA GCGACCAGCA GGCCCGGGTA GTCAGCCAGG TGAGCCGGGC CATCCAGGCG
CCCATGGTCT TCCAGGCCGT GGGGGAGCTG AAACTCCAGC CCCTGCGGGA ATTGGCGCCG
GTGGAGCGGC AAATACTAGT AGAAAAGCAC CTCATCAGTC CCGACCTGGC CGCAGGCGGC
GGCGAAAAGG CCGTTGTCCT GCGGGACGAC GAGGCCGTCA GCATCATGGT CAACGAGGAA
GACCACCTGC GGCTGCAGTG CCTCCTGCCG GCCATGATGC TCCACGAAGC CTGGCGCCTG
GCCGACGCCG CCGACGACGC CCTGGAGAAC GAGCTGGACT TCGCCTTCGA CCAGGAGCGC
GGCTACCTGA CCGCCTGCCC GACCAACGTC GGCACGGGCC TCAGGGCTTC CACCATGCTC
CACCTGCCGG CCCTGGTCCT GACCAGGCAG GCGGGGCCGG TGCTTTCGGC CCTGACCAAG
GTGGGGGTGG CCGTCCGGGG CCTCTACGGC GAGGGCACCG AGGCCCAGGG CAACATCTTC
CAGGTGTCCA ACCAGATCAC CCTGGGCCGT TCTGAAGAGG AAATTATCAA CAACCTCTCG
GCGGTGACGG TGCGCCTGGC CGATCAAGAA CGGGAGGCCC GGGAGCTCCT GCGCCGCCAG
AGCCGCTGGC AGCTGGAGGA CCGGGTTGGC CGGGCCTACG GCGTCCTGAC CAACGCCCGG
ATCTTGAGCT CCCAGGAAGC CCTGCAGCTC CTCTCCGACG TGCGCCTGGG GGCGGAGATG
AAGATCATCC GCGGCCTGGA CCAGCGCCTG CTGAATCAGC TCATGGTCCG CATCCAGCCG
GCCTTCCTCC AGTTTAGCGC CGGCAAAGAG ATGACGCCCA TGGAGCGCGA CGTCCACCGG
GCGGCCATGG TCCGGGAATT GCTGGCAGGC TAA
 
Protein sequence
MSLNLERNSK WMEGSGPQAD IVISSRIRLA RNLKGLPFPN LMNSDQQARV VSQVSRAIQA 
PMVFQAVGEL KLQPLRELAP VERQILVEKH LISPDLAAGG GEKAVVLRDD EAVSIMVNEE
DHLRLQCLLP AMMLHEAWRL ADAADDALEN ELDFAFDQER GYLTACPTNV GTGLRASTML
HLPALVLTRQ AGPVLSALTK VGVAVRGLYG EGTEAQGNIF QVSNQITLGR SEEEIINNLS
AVTVRLADQE REARELLRRQ SRWQLEDRVG RAYGVLTNAR ILSSQEALQL LSDVRLGAEM
KIIRGLDQRL LNQLMVRIQP AFLQFSAGKE MTPMERDVHR AAMVRELLAG