Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1745 |
Symbol | |
ID | 5161557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | - |
Start bp | 1925167 |
End bp | 1926537 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640553662 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_001234868 |
Protein GI | 148260741 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00942929 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCGAC TCTGGCTCGA ACAGGCTTTT CTCCCCGAAG GATGGGCATC CGGCGTCGCG CTCGACATCG ACGGGTCCGG CCGCATCGCC GGCCTCGCCG CGAATGCCCC GCCGGCCGAT CGCGAAACCG TCCCCGGCTG CGTGCTGCCC GGCCTGCCCA ACCTGCACAG CCACGGCTTC CAGCGGGCGA TGTCCGCGCT CGCCCAGCGC CGCCGCGCCG CCGCCGAGGA TTTCTGGTCC TGGCGCGGCG TGATGTACCG CACCGCCCTC GCGCTGCCGC CGGACGACAT CGCCGCCGTC ACCGCCATGG CCTTCATGGA AATGCTCGAG CGCGGCTTCA CCGCGGTCGC CGAATTCCAT TACCTGCACA ACGATTCCGC CGGCGCCCCC TATGCCGATC CCGCCGAGCT CGCCACGCGG GTGATCGAGG CGGCGGCGCA AACCGGCATC GGCCTCACCC TGCTGCCGGT GCTCTACAGC GCCGCAGGCC CCAACCGCCC GCCCGAGCCG GGCCAGCGCC GCTTCATCAC CGATCTCGAC GGGTTTCTGC GCCTGCACGC GGCCACGGCG GAACGGCTGC GCGCCCTGCC GGGCGCCGTG CTCGGCGCCG CGCCGCACTC GCTGCGCGCC GCCCGCGCCG CCGATGTCGC CGCCCTGTCC GGCCTGCTTC CCGAAGGCCC GCTCCACATC CACGCCGCCG AGCAGACCGG CGAGGTCGCC GAGGTCGAGG CCGCGCTCGG CGCCCGCCCG GTCGAATTCC TGCTCCGCGA GGCGGGGGTC GACGCCCGCT GGTGCCTGAT CCACGCCACC CACATGGCGG CCGAGGAAAC CGCCGGCCTC GCCCGCGCCG GCGCCGTCGC CGGCCTCTGC CCGATCACCG AGGCCGATCT CGGCGACGGC ATCTTCAATG GAAACACCTT CATCGATGCC GGCGGGCGTT TCGGCGTCGG CACCGATTCG AATGTCGCGA TCGAGGCCCC CGCCGAGCTG CGCCAGCTCG AATGGAGCCA GCGCCTGCGC GACCGTCGCC GCCTCGTCCT CGCCGACCCG GCGCGCGACG GCGGTTCGAC CGGCGCCGCC CTCTACCGTG CGGCCCTCGC CGGCGGCGCG CAGGCCCTGG CCCAGCCCAT CGGCGCCATC GCCCCCGGCT GCCGCGCCGA TCTCGTCTCG CTCCGCCGCG CCGGCACCGA CCTCGCCGCG CTCTCCGCCG AAACCCGGCT CGATCACTAC ATCTTCGCCG GCGGCGCGCG GCTGGTCGAC CGTGTCTACG TCGCCGGGCG CTGCCTTGTG CGCGAAGGCC GGCACGATGC CCGCGCCGCC ATCGAATCCC GCTTCGCCGC GGCCCTCCGC CGCCTGGAGG ACGCGCTCTG A
|
Protein sequence | MRRLWLEQAF LPEGWASGVA LDIDGSGRIA GLAANAPPAD RETVPGCVLP GLPNLHSHGF QRAMSALAQR RRAAAEDFWS WRGVMYRTAL ALPPDDIAAV TAMAFMEMLE RGFTAVAEFH YLHNDSAGAP YADPAELATR VIEAAAQTGI GLTLLPVLYS AAGPNRPPEP GQRRFITDLD GFLRLHAATA ERLRALPGAV LGAAPHSLRA ARAADVAALS GLLPEGPLHI HAAEQTGEVA EVEAALGARP VEFLLREAGV DARWCLIHAT HMAAEETAGL ARAGAVAGLC PITEADLGDG IFNGNTFIDA GGRFGVGTDS NVAIEAPAEL RQLEWSQRLR DRRRLVLADP ARDGGSTGAA LYRAALAGGA QALAQPIGAI APGCRADLVS LRRAGTDLAA LSAETRLDHY IFAGGARLVD RVYVAGRCLV REGRHDARAA IESRFAAALR RLEDAL
|
| |