Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1826 |
Symbol | |
ID | 7408940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1901658 |
End bp | 1902644 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716203 |
Product | ATP:guanido phosphotransferase |
Protein accession | YP_002573692 |
Protein GI | 222529810 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3869] Arginine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000574552 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGATA TTGTTATTAC AAGCAGAATA AGACTTGCAA GGAATCTTTC TGATGTGCCT TTTACCATAA AGATGAATGA CTATGATGCC TCAAATGTTA TAGACAGGGT AAGGGATGTA ATTTTGAAAA ACAAGCAGTA TCATTTTGAG TTTTTTGAGA TAAAAAAGCT ACCTCTTATA AAACGGCAGG TGTTAATAGA AAAACATTTA ATATCACCCG CACTTGCATC ATCAAAAATA AAAAGTGCTG TGGCAATTGA CCAAAATGAG AATATCAGCA TTATGATAAA TGAAGAGGAC CATTTAAGAA TTCAGGTTCT TTATAGGGGA CAGCAGATAC AAAAAGCATG GGAAGATGCA AACAGGATTG ACGACTTTTT GGAACAGCAT CTTCCTTATG CCTATGACGA AACATGGGGG TATCTTACTT CATGTCCTAC AAATGTTGGA ACCGGTTTGA GAGCATCTTT TATGCTTCAT CTTCCTGCTT TGACACTTTT GGGATACATG AAAGGAATAA TTGACACAAT AACAAAGTTG GGGATTGCAG TGAGAGGATT TTATGGTGAG GGAAGTGAGG CAGCAGGAAA TCTTTATCAA ATTTCTAATC AGATTACCTT AGGTCAGCCT GAAGAAGACA TTATAGCAAA TGTAATTTCA ATTACAAACC AGATAATAGA GCAGGAACAG CAGGCAAGAT TGAAGCTTTT GAGCGAAAAC AGAGCTTTTG TCGAAGACAA GGTTTACAGA GCATATGGAA TTTTAAAATA TGCAAGGAAT ATCTCTTCAA ACGAAGCTTT AAAGCTCATA TCTGATGTGA GAATGGGTAT CAGTATGGGT ATAATTAAAG AAACAACAAT TGACAGGTTG GATGTGCTTT TGAATTTAAT TCAGCCAGCC ATTATCCAAG ACTACTTTGG CAGGGAAATG ACACCAGAGG AAAGAGACAT AAAAAGAGCA GAACTTATAA GAAAAATTTT GGAATAG
|
Protein sequence | MNDIVITSRI RLARNLSDVP FTIKMNDYDA SNVIDRVRDV ILKNKQYHFE FFEIKKLPLI KRQVLIEKHL ISPALASSKI KSAVAIDQNE NISIMINEED HLRIQVLYRG QQIQKAWEDA NRIDDFLEQH LPYAYDETWG YLTSCPTNVG TGLRASFMLH LPALTLLGYM KGIIDTITKL GIAVRGFYGE GSEAAGNLYQ ISNQITLGQP EEDIIANVIS ITNQIIEQEQ QARLKLLSEN RAFVEDKVYR AYGILKYARN ISSNEALKLI SDVRMGISMG IIKETTIDRL DVLLNLIQPA IIQDYFGREM TPEERDIKRA ELIRKILE
|
| |