Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3659 |
Symbol | |
ID | 4898307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 760203 |
End bp | 761246 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640114267 |
Product | carbonic anhydrase |
Protein accession | YP_001045521 |
Protein GI | 126464408 |
COG category | [R] General function prediction only |
COG ID | [COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0725502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.181373 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGAGA AATGTCCGCC CCTGATCGCC GCCTACCACG GCATGTCGCC GGAGTTCGCG GGCGAGCCCG CCTTTGCGGG CGCGGGCGCC GCCGTCCTCG GGCGCGCGAG GCTCGGCCGG GGCCTCTGGC TCGGCGCGCG CTCCGTGATC CGGGCCGACG GGCATCACAT CCATGTCGGC GACGACTTCC ACCTCGGCGA GGGGGCGACC GTCCATATTG CGCACGATGT CTATCCGACG CATGTCGGCC AGAATGTCAC CGCCGGCAAG GGTGCGGTGA TCCATGCCTG CACCATCGGC GACAATTGCG TGATCGAGCG GGGGGCCGTC ATTCTCGACG GGTCCGAAGT GGCCGACGGA GTGGTCGTGA CGGCGGGATC GGTGGTCTTT CCGCGCTCGA AGCTCGAGGC GGGCTGGCTC TATTCCGGCA GCCCGGCGCA GCGCGTGGCC CGCGTCTCGG CCTCCGAGCT CGCCTCCTAC CATCAGCAGA CGCGGAACGA CCTCTCCTCC GGAAAGGCCG GCCCGGCGGG CGACGGGGCA GGTCGGGGCC ATGTCTTCGT GGCCCCCACG GCGACCTTGG CCGGCCGCGT CACCATGGAG GAGGGGGTGG GCGTCTGGTA TGGCTGCCGG CTCGAGGCGG GCAGCCACGA GATCCGCATC GGCGAGGGCA CCAACGTGCA GGACAACAGC ACGATCCTCT GCGAGACGCG GGACGTGGAG ATCGGCCCCG ACGTCACCAT CGGCCACAAT GTCCTCCTCG TCGATTGCCG GGTCGAGCGG GCGAGCCTCG TCGGCATCGG CTCGCGCATC GCGGCCGGCA CCGTGATCGA GAGCGACGTG CTCGTCGCGG CCGGCACCGA GACCGAACCC GACCAGCGCC TCACCGGCGG CAAGGTCTGG GCGGGGCGGC CCGCGCGCCC GATCGCCGAC ATGACCGACG CGCGCCGCGG CATGCTGGCC GCGACCCTGC CCATGTATCG CGACTATGCC ACGCAGTTCG CCGGCACGTC GCACCAGCCG ATGCTTCAGC CCGGGGAGGA GTGA
|
Protein sequence | MHEKCPPLIA AYHGMSPEFA GEPAFAGAGA AVLGRARLGR GLWLGARSVI RADGHHIHVG DDFHLGEGAT VHIAHDVYPT HVGQNVTAGK GAVIHACTIG DNCVIERGAV ILDGSEVADG VVVTAGSVVF PRSKLEAGWL YSGSPAQRVA RVSASELASY HQQTRNDLSS GKAGPAGDGA GRGHVFVAPT ATLAGRVTME EGVGVWYGCR LEAGSHEIRI GEGTNVQDNS TILCETRDVE IGPDVTIGHN VLLVDCRVER ASLVGIGSRI AAGTVIESDV LVAAGTETEP DQRLTGGKVW AGRPARPIAD MTDARRGMLA ATLPMYRDYA TQFAGTSHQP MLQPGEE
|
| |