Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1531 |
Symbol | |
ID | 7318018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1640946 |
End bp | 1642265 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643616422 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_002513602 |
Protein GI | 220934703 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACCA TCGACACCCT CATCAAGGCA CGCTGGATCA TTCCGGTGGA GCCCGACGAC ACCGTCCTCG AACACCATGC CCTGGCCATC CGCGCCGGGC GCATCGTGGC GCTCCTGCCC TCGGCCGAGG CCGATGACCG CTACCGGGCG GACAAGGTCC ATGAACTGCC CCACCACGCC CTGATCCCGG GGCTGGTGAA CACCCACACC CATGCCGCCA TGAGCCTGAT GCGCGGTCTG GCCGATGATC TGCCCCTGAT GGAATGGCTC AAGGGCCACA TCTGGCCCGC CGAGGGCCGC TGGGTGGGCG CGGAGTTCGT GGAGGACGGC ACTCTGCTGG CCATGGCGGA GATGCTGCGC GGCGGGGTGA CCTGCTTCAA CGACATGTAC TTCTTCCCGG AGATCACCGC CCACGCCGCC GCGCGGGCGG GCATGCGCGC GGCACTGGGC CTGATCGTCA TCGATTTCCC CACCGCCTGG GCGGCCAATG CGGATGAGTA CATCGCCAAG GGCCTGGCCC TCTACGACGA CCACAAGGAC GAGGCCCTGC TGTCATTCTG CTTCGCGCCC CACGCGCCCT ATACCGTCTC CGACGAGCCC CTCAAGCGCA TCCGGACCCT GGCCAATGAG CTGGACCTGC CGGTACACAT GCACGTGCAC GAGACCGCCC ACGAGGTGGA AGAATCCATG GCCCGCTTCG GCATGCGCCC GCTAGAGCGC CTGGCGCAGC TGGGCCTGGT GGGCCCCAAC CTGCTGGCCG TGCACATGAC CCAGCTGGAA GATGCCGAGA TCGCCCACCT GGCCGAGGCT GGCGCCCACG TGCTGCACTG CCCTGAATCG AACCTCAAGC TGGCCAGCGG TTTCTGTCCG GTCCAGAAAC TCCTGGATGC CGGGGTGAAC GTGTGCCTGG GCACCGACGG CGCCGCCAGC AACAACGACC TGGACCTGAT GGGCGAGATG CGCACCGCCG CGCTGCTGGC CAAGGGGGTG GCCGGGGATG CCGCCGCCCT GCCTGCCGCT GCGGCCCTGC GCATGGCGAC CTTGAACGGC GCCCGGGCCC TGGGGCTCGG CGAGGAGACC GGCTCCCTGG TGCCCGGCAA GGCAGCGGAC GTGGTGGCCG TGGATCTCGG CGCCCTCGAA AGCCGCCCCG TGTATCACCC CGTCTCACAC CTGGTCTATG CCACGGGCCG ACAGCAGGTG ACCCATGTCT GGGTGGCCGG CAAGGCCCTG CTGAAGGACC GCCGCCTGAC GACCCTGGAC CTGGAGGCGA TCCAGGCCAG GGCGATGGCG TGGCAGGAGC GCTTGAAGGC AAATGCATGA
|
Protein sequence | METIDTLIKA RWIIPVEPDD TVLEHHALAI RAGRIVALLP SAEADDRYRA DKVHELPHHA LIPGLVNTHT HAAMSLMRGL ADDLPLMEWL KGHIWPAEGR WVGAEFVEDG TLLAMAEMLR GGVTCFNDMY FFPEITAHAA ARAGMRAALG LIVIDFPTAW AANADEYIAK GLALYDDHKD EALLSFCFAP HAPYTVSDEP LKRIRTLANE LDLPVHMHVH ETAHEVEESM ARFGMRPLER LAQLGLVGPN LLAVHMTQLE DAEIAHLAEA GAHVLHCPES NLKLASGFCP VQKLLDAGVN VCLGTDGAAS NNDLDLMGEM RTAALLAKGV AGDAAALPAA AALRMATLNG ARALGLGEET GSLVPGKAAD VVAVDLGALE SRPVYHPVSH LVYATGRQQV THVWVAGKAL LKDRRLTTLD LEAIQARAMA WQERLKANA
|
| |