Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_5022 |
Symbol | |
ID | 5420503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009717 |
Strand | + |
Start bp | 241445 |
End bp | 242491 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640873676 |
Product | carbonic anhydrase |
Protein accession | YP_001409456 |
Protein GI | 154243883 |
COG category | [R] General function prediction only |
COG ID | [COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.585067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGT CTCCCTTCGT CCTGCCTTAT CACGGCATCT TGCCGGTCTA TCCGGACCTG CTGCGTGCGG GTCCGCGCGC GGCGCTCATT GGCCGGGTCA CGCTTGGGCC CCGCGCCGCG CTCGGCACCC TCGCGCTTAT CCGAGGCGAC GGCCACGTTG TGGAGATCGG CGCGGACTTC CATCTCGGCG ACTGGGGCAC GGTCCATATC GCCCACGAGA TGCATCCAGC GATTGTCGGG GATCGGGTCA CCGTTGGACC AGACGCGGTG GTCCATGCCT GCACGGTCGG CGATGACTGC GTGATCGAGG AGGATGCCAT CATCCTTGAC GGATCCGTCC TGGAGAACGG CGTCGTCATG GAGGCGGGCA CCATCGCCTT TCCCCGGTCG AGGCTCGAGG CGGACACCCT CTACGCCGGC GCGCCGGCAA AGCCCGTGCG CAGGATCGAT GCGGCCGAAC GCGCTGCGCG GGCGGCGCGC ATCCGCGCCC TCGCCAGCGC GCCTCCGCCT CCGCCGCCGG CCGGCGGCGC CGCACGGCTC GATATTCACT CCACCACTTT CATCGCCGCC AACTCCTCCG TCTCAGGCGC CTTCAAGGCG GATGCGCATG CGTCGGTGCT GTTCAGCTGC ACCCTCGACG CGCGCAACTC AGAAATCTCG CTGGGCGAGA ATTCCAACAT CCAGGACAAC AGCTTGGTCC GCTGTCCCGA CGGCCCCGTC GTCGTCGCCG CCAATGCCGT GGTCGGGCAC AACGTGGTGC TGGAGAGCTG CACCGTCGGC ACCGGCTCGC TGGTGGGCAC CGGCAGCCGC GTGGCGCCCG GCACCGTCGT GGAGCCGGAC GTCCTGCTCG CGGCCGGAGC CCGCACGCAG AGGGGCCAGG TGCTGGAAAG CGGCTTCCTG TGGGGCGGAA ATCCAGCGCG CAGGATCGCG CCGCTCGACG ACAAGAAGCG CCAGATGATC CCCTGGATCA TCTCCACCTA TTGCGAGTAC ACCGCCGACT TCCTCGCGAG CCAGCATCGC GCGGCTCAGC GGGCCAGCTT CGGCTAA
|
Protein sequence | MSASPFVLPY HGILPVYPDL LRAGPRAALI GRVTLGPRAA LGTLALIRGD GHVVEIGADF HLGDWGTVHI AHEMHPAIVG DRVTVGPDAV VHACTVGDDC VIEEDAIILD GSVLENGVVM EAGTIAFPRS RLEADTLYAG APAKPVRRID AAERAARAAR IRALASAPPP PPPAGGAARL DIHSTTFIAA NSSVSGAFKA DAHASVLFSC TLDARNSEIS LGENSNIQDN SLVRCPDGPV VVAANAVVGH NVVLESCTVG TGSLVGTGSR VAPGTVVEPD VLLAAGARTQ RGQVLESGFL WGGNPARRIA PLDDKKRQMI PWIISTYCEY TADFLASQHR AAQRASFG
|
| |