Gene Arth_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3333 
Symbol 
ID4444062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3745422 
End bp3747773 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content70% 
IMG OID639691156 
Productcarbonate dehydratase 
Protein accessionYP_832808 
Protein GI116671875 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0288] Carbonic anhydrase
[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAG CACCCGCCAA GACGGACCCG CCCGCACCGC AACGGACCGC CCCATCAGGC 
CACCACCCCG CTGAAAAGGA TCCCGGGGGC CTTCGGGAAT TCCTGACCAC CGGGCTCCGC
TGGGACCTGC CGGCGTCGCT GGTGGTGTTC CTGGTGGCCG TCCCGCTGTC GCTGGGCATC
GCGGCCGCCT CCGGCGCCCC GGTGATGGCC GGCCTGATTG CCGCCGCCGT GGGCGGGATC
GTCGCCGGCA GCCTGGGTGG TTCCCCGCTG CAGGTCAGCG GGCCAGCCGC CGGACTGACA
GTCATCGTTG CCGGCCTGAT TGAACAGTTC GGCTGGCCGG TCACCTGCGC CATCACCGCG
GCAGCCGGCG TGCTCCAGGC ACTGCTCGGA CTGGCACGGG TGGGCCGCGT CGCGCTGGCC
ATCGCACCGG TGGTGGTCCA TGCCATGCTC GCGGGCATCG GCATCACCAT TGTGCTGCAG
CAGCTGCACG TGATGCTCGG CGCCGAGTCG GCCAGCGAGG CCTGGGAGAA CATCATGGGC
ATGCCGGGCA GCTTCCTTGC CGCCGACATC GCCGCTGCAG TCCTTGGTGC TGTAGTTATC
GCCCTGCTGC TCGGCTGGAA GCACCTTCCT CCCGCTGTCC GCCGGGTTCC CGGGCCCCTT
GTTGCGGTCA TCGCGGCCAC CGCCCTCTCC CTGCCCTTCA ACGTGGACCG GATCACCTTT
GACGGCTCCC TGCTGGGCGC ACTTGCCTTT CCCGAGCTGC CCGACGGGAA CTGGACCGCG
GTTGTCCTGG GCGTGGTGAC CATCGCGCTG ATCGCCAGTG TGGAATCCCT GCTCTCGGCC
GTGGCCGTCG ACAAGATGCA CCACGGGAGG CGAACGGACT TCAACCGCGA ACTCCTGGGG
CAGGGCGCCG CCAATGTGAC CTCCGGGATG CTTGGAGGCC TGCCCGTGAC CGGGGTGATC
GTCCGGAGCG CCACCAACCT TGAGGCCGGT GCCCGGACGC GCAAGTCCGC AATCCTCCAC
GGTGTCTGGG TGCTGGTCTT CTCGCTGCTG CTGGCGGGCC TGATCCAGCT GATTCCGCAG
GCTGTCCTCG CAGGACTGCT CATCGTGATC GGTTCGCGCC TGGTCCGGGC GGCCGACATC
AGGACGGCAC GGCGGACGGG CGACCTGACC GTCTACGGTG TGACCCTGTT CTGCGTGGTG
TTCGTCAATC TGCTCGTGGG AGTTTTGACC GGCCTGGTCC TGGCCGTTGC GCTGGTTCTC
TGGCGTGTGG CGAGGGCCAG CATCCACGCT GAACCGGCAG GCACCGGCGA CGGGAAGCGC
TGGCGGGTGG TGATCGACGG CTCATGCAGC TTCCTCTCCC TCCCGCGGCT GAGCGCCGTG
CTGGCCTCGG TTCCGGCCGG GGCGCACGTC ACGGTGGAGC TGGAGGTGGA CTTCCTGGAC
CATCCCGTCC ACGACACCCT TGACGCCTGG CGCAACAGGC ACGTGGGCAA CGGCGGCACG
GTGGTCGTTG AGGAAAGCGG CACGGCCACG CTCCACGACG CACAGGCCGG CCCGCCGAGC
CGCGGCAGTT CACGCTCCGC CCTTCGGAGC GGCTTCGCCC CCTGGCGCAG CTGGCAGCAG
CGGCTCACCG GGCACGCGGC CTCCGGGACT GAGGCTGCGC GGCCGAACGT TGCGGGACCG
CGGGTTCCAG GGCCCGATGT TCCCGGGCCG CTGCGGTCGG TGCTGGACGG TGTGGACAAC
TACCACCGGC GGAACGCCCA TCTGGTGCGT CCCCATGTGC AGGAGCTGTC CTCGTACCAG
GATCCGGGCA CGCTGTTCGT GGCCTGCTCG GACTCCCGCC TGGTGCCGAA CCTGATCACC
AGCAGCGGCC CGGGTGATCT CTTCACCGTG CGGAACGTGG GCAATGTGGT GGGCGACGAC
GGCCGGGACG CCTCCATCGA GGCAGCCCTG GAGTTTGCCC TCAACGAACT CTCGGTGGAG
TCGATTGTGG TCTGCGGGCA TTCGGGCTGC GGTGCCATGA CCGCCCTGTG GGCGGACCCG
GACGGCGCGG GCGATCGAGG CGCCATCGAC GTCTGGCTCG ACCACGCCCG GCCAAGCCTG
ATGGCTTTCC GCGACGGGCA CCCCGTCCAG GCAGCTGCAG CCGAGGCGGG TTTCGGTGCC
GTGGACCAGC TGGCCATGGT GAACGTTGCG GTCCAGCTGG ACAGGCTCCT GGGCCACCCG
GGATTGCGGG AACCGCTGGA CTCAGGACGC GTCCACGTCG CAGGGCTGTT CTACGACATC
TCCACGGCCC GCGTCCTGCA GATCACGCCC GACGGCATCG GCCACCTGGA CGCCTCCCGG
GAGAGCCGCT AA
 
Protein sequence
MPSAPAKTDP PAPQRTAPSG HHPAEKDPGG LREFLTTGLR WDLPASLVVF LVAVPLSLGI 
AAASGAPVMA GLIAAAVGGI VAGSLGGSPL QVSGPAAGLT VIVAGLIEQF GWPVTCAITA
AAGVLQALLG LARVGRVALA IAPVVVHAML AGIGITIVLQ QLHVMLGAES ASEAWENIMG
MPGSFLAADI AAAVLGAVVI ALLLGWKHLP PAVRRVPGPL VAVIAATALS LPFNVDRITF
DGSLLGALAF PELPDGNWTA VVLGVVTIAL IASVESLLSA VAVDKMHHGR RTDFNRELLG
QGAANVTSGM LGGLPVTGVI VRSATNLEAG ARTRKSAILH GVWVLVFSLL LAGLIQLIPQ
AVLAGLLIVI GSRLVRAADI RTARRTGDLT VYGVTLFCVV FVNLLVGVLT GLVLAVALVL
WRVARASIHA EPAGTGDGKR WRVVIDGSCS FLSLPRLSAV LASVPAGAHV TVELEVDFLD
HPVHDTLDAW RNRHVGNGGT VVVEESGTAT LHDAQAGPPS RGSSRSALRS GFAPWRSWQQ
RLTGHAASGT EAARPNVAGP RVPGPDVPGP LRSVLDGVDN YHRRNAHLVR PHVQELSSYQ
DPGTLFVACS DSRLVPNLIT SSGPGDLFTV RNVGNVVGDD GRDASIEAAL EFALNELSVE
SIVVCGHSGC GAMTALWADP DGAGDRGAID VWLDHARPSL MAFRDGHPVQ AAAAEAGFGA
VDQLAMVNVA VQLDRLLGHP GLREPLDSGR VHVAGLFYDI STARVLQITP DGIGHLDASR
ESR