Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0908 |
Symbol | |
ID | 7084766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 999770 |
End bp | 1000519 |
Gene Length | 750 bp |
Protein Length | 249 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697931 |
Product | 1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase |
Protein accession | YP_002354571 |
Protein GI | 217969337 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCTCA TTCCCGCCAT CGACCTCAAG GACGGTCAAT GTGTTCGCCT GAAACAGGGC GAGATGGACG ACGCCACGGT GTTCTCCTCC GATCCGGGCG CCATGGCGCG CCACTGGCTC GAGGCCGGCG CGCGCCGCCT GCACCTGGTG GACCTCAACG GTGCCTTTGC CGGCAAGCCG AAGAACGGCG CCGCGATCCG TTCGATCACC GACGTCGTCG GTGACGACAT CCCGGTCCAG CTCGGCGGTG GCATCCGCGA CCTCGACACG ATCGAGCATT ACCTCGACAA CGGCATCAGC TACGTCATCA TCGGCACCGC CGCGGTCAAG AACCCCGGCT TCCTGCACGA CGCCTGCAGC GCCTTTCCCG GCCACATCAT CGTCGGCCTC GACGCCAAGG ACGGCAAGGT GGCGGTGGAC GGATGGTCCA AGCTCACCGG CCACGACGTG GTCGATCTGG CGAGGAAGTT CGAGGACTAC GGCGTGGAGT CGGTGATCTA CACCGACATC GGCCGCGACG GCATGCTTTC GGGCGTCAAT ATCGAGGCCA CCGTGCGTCT GGCGCGCGCG CTGCGCATCC CGGTCATCGC CAGCGGCGGC ATCACCGACC TGCGCGATAT CGACGCGCTG TGCGCGGTCG AGGACGAGGG CATCATGGGC GCGATCACCG GGCGCGCGAT CTACGAGGGC ACGCTCGACT TCGCCGCCGC GCAGGCGCGC GCGGACGAGC TCGAGGGCGC GCGCGAATGA
|
Protein sequence | MLLIPAIDLK DGQCVRLKQG EMDDATVFSS DPGAMARHWL EAGARRLHLV DLNGAFAGKP KNGAAIRSIT DVVGDDIPVQ LGGGIRDLDT IEHYLDNGIS YVIIGTAAVK NPGFLHDACS AFPGHIIVGL DAKDGKVAVD GWSKLTGHDV VDLARKFEDY GVESVIYTDI GRDGMLSGVN IEATVRLARA LRIPVIASGG ITDLRDIDAL CAVEDEGIMG AITGRAIYEG TLDFAAAQAR ADELEGARE
|
| |