Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1790 |
Symbol | |
ID | 4810035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2113866 |
End bp | 2114888 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107204 |
Product | ATP:guanido phosphotransferase |
Protein accession | YP_001038204 |
Protein GI | 125974294 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3869] Arginine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000255065 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAT GGTACATGCA AATAGGTCCT GAATCAGACG TGGTAATGAG CACGAGAGTG CGAATAGCAA GGAACTTTAA CGGGATTCCT TTCCCGTCCA AAATGAAAAG GGAAGACGGG AAATTGGTAA TAAAAAAGGT TAAGGAAGCA ATTTTCGGAA GAAGTTCCGT TGCAAATAAT TTCAGATTTA TAGATATTCA TGAATTGACA CCGATTCAAA GACAGGTGCT GGTGGAAAAA CATCTTATAA GCCCGGACCT GGCGGAAAGC CGTATTGAAA GCGGAGTGAT AATCAGCGCG GAAGAAAATA TAAGCATAAT GATTAATGAG GAAGATCATC TCAGGATACA ATGTCTGGCT GCAGGTTTGC AGCTTGAAGA TACATGGAAC CTGTGCAACC AGATAGACAA CCTGCTGGAG GAAACCATTG ATTATGCTTT CGACGAAAAA TTCGGATATC TTACCTGTTG TCCCACCAAT CTGGGGACCG GTATAAGGAC TTCGGTAATG CTTCATCTTC CCGCTCTTAC CATGACAGGA TACATAAAAG GTATTCTTGA GGCATGCACA AAGCTTGGAA TTGCGGTAAG AGGGCTTTAC GGCGAGAATT CCGAAGCGTC GGGAAACATG TATCAGATAT CCAACCAGGT TACTCTGGGA CTTACCGAGG AAGAGATAAT TTCCAACATC AACAACATCG CAAAACAGAT AATCGATCAG GAGAGAAACT TAAGAAAACA GCTTTACAAG CAAAATACCT ACAGATTTGA GGACAGGATT TTCCGTTCTT TGGGTCTTTT GTCCAACGCG AGGATAATGA CATCGGAAGA GGGATTAAAA TTATTGTCCG ATGTTAGATT GGGAGTTGAT ATGGGAATAA TTACAGATAT AGACATCAGA AAGCTTAATG AAATACAACT CCTGGTACAG CCGGCAAACC TGCAGGAGAG CGTGGGACGG CCAATGAATC CGGAAGAAAG GGATATAAAG AGGGCGGAAA CCATAAGGAA CAAACTAAGG TAA
|
Protein sequence | MNEWYMQIGP ESDVVMSTRV RIARNFNGIP FPSKMKREDG KLVIKKVKEA IFGRSSVANN FRFIDIHELT PIQRQVLVEK HLISPDLAES RIESGVIISA EENISIMINE EDHLRIQCLA AGLQLEDTWN LCNQIDNLLE ETIDYAFDEK FGYLTCCPTN LGTGIRTSVM LHLPALTMTG YIKGILEACT KLGIAVRGLY GENSEASGNM YQISNQVTLG LTEEEIISNI NNIAKQIIDQ ERNLRKQLYK QNTYRFEDRI FRSLGLLSNA RIMTSEEGLK LLSDVRLGVD MGIITDIDIR KLNEIQLLVQ PANLQESVGR PMNPEERDIK RAETIRNKLR
|
| |