Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3333 |
Symbol | |
ID | 4444062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3745422 |
End bp | 3747773 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639691156 |
Product | carbonate dehydratase |
Protein accession | YP_832808 |
Protein GI | 116671875 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0288] Carbonic anhydrase [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCAG CACCCGCCAA GACGGACCCG CCCGCACCGC AACGGACCGC CCCATCAGGC CACCACCCCG CTGAAAAGGA TCCCGGGGGC CTTCGGGAAT TCCTGACCAC CGGGCTCCGC TGGGACCTGC CGGCGTCGCT GGTGGTGTTC CTGGTGGCCG TCCCGCTGTC GCTGGGCATC GCGGCCGCCT CCGGCGCCCC GGTGATGGCC GGCCTGATTG CCGCCGCCGT GGGCGGGATC GTCGCCGGCA GCCTGGGTGG TTCCCCGCTG CAGGTCAGCG GGCCAGCCGC CGGACTGACA GTCATCGTTG CCGGCCTGAT TGAACAGTTC GGCTGGCCGG TCACCTGCGC CATCACCGCG GCAGCCGGCG TGCTCCAGGC ACTGCTCGGA CTGGCACGGG TGGGCCGCGT CGCGCTGGCC ATCGCACCGG TGGTGGTCCA TGCCATGCTC GCGGGCATCG GCATCACCAT TGTGCTGCAG CAGCTGCACG TGATGCTCGG CGCCGAGTCG GCCAGCGAGG CCTGGGAGAA CATCATGGGC ATGCCGGGCA GCTTCCTTGC CGCCGACATC GCCGCTGCAG TCCTTGGTGC TGTAGTTATC GCCCTGCTGC TCGGCTGGAA GCACCTTCCT CCCGCTGTCC GCCGGGTTCC CGGGCCCCTT GTTGCGGTCA TCGCGGCCAC CGCCCTCTCC CTGCCCTTCA ACGTGGACCG GATCACCTTT GACGGCTCCC TGCTGGGCGC ACTTGCCTTT CCCGAGCTGC CCGACGGGAA CTGGACCGCG GTTGTCCTGG GCGTGGTGAC CATCGCGCTG ATCGCCAGTG TGGAATCCCT GCTCTCGGCC GTGGCCGTCG ACAAGATGCA CCACGGGAGG CGAACGGACT TCAACCGCGA ACTCCTGGGG CAGGGCGCCG CCAATGTGAC CTCCGGGATG CTTGGAGGCC TGCCCGTGAC CGGGGTGATC GTCCGGAGCG CCACCAACCT TGAGGCCGGT GCCCGGACGC GCAAGTCCGC AATCCTCCAC GGTGTCTGGG TGCTGGTCTT CTCGCTGCTG CTGGCGGGCC TGATCCAGCT GATTCCGCAG GCTGTCCTCG CAGGACTGCT CATCGTGATC GGTTCGCGCC TGGTCCGGGC GGCCGACATC AGGACGGCAC GGCGGACGGG CGACCTGACC GTCTACGGTG TGACCCTGTT CTGCGTGGTG TTCGTCAATC TGCTCGTGGG AGTTTTGACC GGCCTGGTCC TGGCCGTTGC GCTGGTTCTC TGGCGTGTGG CGAGGGCCAG CATCCACGCT GAACCGGCAG GCACCGGCGA CGGGAAGCGC TGGCGGGTGG TGATCGACGG CTCATGCAGC TTCCTCTCCC TCCCGCGGCT GAGCGCCGTG CTGGCCTCGG TTCCGGCCGG GGCGCACGTC ACGGTGGAGC TGGAGGTGGA CTTCCTGGAC CATCCCGTCC ACGACACCCT TGACGCCTGG CGCAACAGGC ACGTGGGCAA CGGCGGCACG GTGGTCGTTG AGGAAAGCGG CACGGCCACG CTCCACGACG CACAGGCCGG CCCGCCGAGC CGCGGCAGTT CACGCTCCGC CCTTCGGAGC GGCTTCGCCC CCTGGCGCAG CTGGCAGCAG CGGCTCACCG GGCACGCGGC CTCCGGGACT GAGGCTGCGC GGCCGAACGT TGCGGGACCG CGGGTTCCAG GGCCCGATGT TCCCGGGCCG CTGCGGTCGG TGCTGGACGG TGTGGACAAC TACCACCGGC GGAACGCCCA TCTGGTGCGT CCCCATGTGC AGGAGCTGTC CTCGTACCAG GATCCGGGCA CGCTGTTCGT GGCCTGCTCG GACTCCCGCC TGGTGCCGAA CCTGATCACC AGCAGCGGCC CGGGTGATCT CTTCACCGTG CGGAACGTGG GCAATGTGGT GGGCGACGAC GGCCGGGACG CCTCCATCGA GGCAGCCCTG GAGTTTGCCC TCAACGAACT CTCGGTGGAG TCGATTGTGG TCTGCGGGCA TTCGGGCTGC GGTGCCATGA CCGCCCTGTG GGCGGACCCG GACGGCGCGG GCGATCGAGG CGCCATCGAC GTCTGGCTCG ACCACGCCCG GCCAAGCCTG ATGGCTTTCC GCGACGGGCA CCCCGTCCAG GCAGCTGCAG CCGAGGCGGG TTTCGGTGCC GTGGACCAGC TGGCCATGGT GAACGTTGCG GTCCAGCTGG ACAGGCTCCT GGGCCACCCG GGATTGCGGG AACCGCTGGA CTCAGGACGC GTCCACGTCG CAGGGCTGTT CTACGACATC TCCACGGCCC GCGTCCTGCA GATCACGCCC GACGGCATCG GCCACCTGGA CGCCTCCCGG GAGAGCCGCT AA
|
Protein sequence | MPSAPAKTDP PAPQRTAPSG HHPAEKDPGG LREFLTTGLR WDLPASLVVF LVAVPLSLGI AAASGAPVMA GLIAAAVGGI VAGSLGGSPL QVSGPAAGLT VIVAGLIEQF GWPVTCAITA AAGVLQALLG LARVGRVALA IAPVVVHAML AGIGITIVLQ QLHVMLGAES ASEAWENIMG MPGSFLAADI AAAVLGAVVI ALLLGWKHLP PAVRRVPGPL VAVIAATALS LPFNVDRITF DGSLLGALAF PELPDGNWTA VVLGVVTIAL IASVESLLSA VAVDKMHHGR RTDFNRELLG QGAANVTSGM LGGLPVTGVI VRSATNLEAG ARTRKSAILH GVWVLVFSLL LAGLIQLIPQ AVLAGLLIVI GSRLVRAADI RTARRTGDLT VYGVTLFCVV FVNLLVGVLT GLVLAVALVL WRVARASIHA EPAGTGDGKR WRVVIDGSCS FLSLPRLSAV LASVPAGAHV TVELEVDFLD HPVHDTLDAW RNRHVGNGGT VVVEESGTAT LHDAQAGPPS RGSSRSALRS GFAPWRSWQQ RLTGHAASGT EAARPNVAGP RVPGPDVPGP LRSVLDGVDN YHRRNAHLVR PHVQELSSYQ DPGTLFVACS DSRLVPNLIT SSGPGDLFTV RNVGNVVGDD GRDASIEAAL EFALNELSVE SIVVCGHSGC GAMTALWADP DGAGDRGAID VWLDHARPSL MAFRDGHPVQ AAAAEAGFGA VDQLAMVNVA VQLDRLLGHP GLREPLDSGR VHVAGLFYDI STARVLQITP DGIGHLDASR ESR
|
| |