Gene P9303_14471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14471 
SymbolvacB 
ID4776505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1244201 
End bp1246591 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content49% 
IMG OID640086956 
Productputative acetazolamide conferring resistance protein Zam 
Protein accessionYP_001017458 
Protein GI124023151 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCA CGGTCGCTGA CCTGCTTGAC CAGTTACCCC CCACTGGGGG ACTGGAGATC 
AAAAAACTCG AAAAAATTCT CAAGCTCACC ACAAAGGCCG ATCGCGACGG TCTTGAAACA
GCTCTTCAGG GTCTCTTGAG ACTCGGCATT GTCAACAACG AAGAAGCCGG TGCAATAAAG
CGCAGTGATG ACGAGTCACT GATTGAAGCT CGTTTGCGCT GCAGCAGTAA AGGATTTTGT
TTTGCCCTAA GAGATGATGG GGGAGACGAT ATCTATATTC GCGATCATCA GCTCAACCAT
GCCTGGAACG GAGATCGGGT ACTAGTCCGC ATCACAAGAG ATGGGGGACG TCGAAGATCG
CCAGAAGGTG GGGTGCAATG CATCCTCGAA AGGACCACCA CCAGCTTGCT TGGTCATGTT
GAACGCAAAG ATCAAAATGT CCTTGCAATA CCTCTTGATG ACCGCATCCT TGCAACGATC
CAACTACCAG ATGGTGATAA AGCCCATCTG AATGATGGGG AGCAGACAAG CGTGGTGGAG
GTGAAGCTTG ACCGCTATCC CATTGCCCAG TTCCCCGCTG AAGGGCATGT AGCGCGGTCA
CTTCCCTTGA ATGGAGGCCC CTCAAGTGAT CGTGATCTTC TTCTTACCAA AGCCAACCTT
CAAGATCGAC CTGCTCCACC ACGCAGCAGC CTAAAAACCC CAACGGCTAA ACATCGGCAA
GATCTGAGCT CACAACCAGC GTTGCTCCTG CGCAGCTGGC AGGTATCTGA TGCTCCACCC
CTACCAGCGG TCTATGTGGA GCCCCATGCA GGTGGCACAC GTCTGTGGAT TCATGCGCCT
GCTGTAGCAG AACGTCTGAG TATTGGGAAC AACCTGGATC TCTGGCTAAG AGATCGAGCT
GAAGCTCTCT GCCTTGGCGA AGTTTGGCAT CCCCTACTCG GGCAAATTCT TTCAAAGGCC
TGTTCATTTA AAGTTGGGGA AATAAACGAT GCAGTAAGCG TTGCTCTCGA CATCTCAGCA
GATGGTGAGG TGATTGATTG GCATTTCAGT CTTAGCTCAA TCAAGCCAGT AGCCGAAATC
GGACCGAAGA CCCTCACAGC TCTTAGCAAC AGGAAACCAA AGGCAAGAAC AGTCCCTGCT
GCTCTGAAGT CCGTCAAGGA TCACTTAGCC CAACTCGAGA CATTGATCTT TTGTGCTCGT
ACATTGCAAA TGGGAGAACA GGCCAGTGGC TCTATAGAGC TGGATCTGGC TGTTCCAGAA
CTTCAATGTC TTGGCGACCT CCGCTGGGCA GACCCTGATT CATATCGTCA TCAATGGACG
CTTCCTCTTG ATCAAACTGA CCCCCAATCA GTTTTGAGCC CAATAATTAG AGCTGCTCAT
CGCGCCTGGG CACAACATGC CCAGAACTTG CAACTCCCTG GCCTGGTGAT TGAGGCTTCA
GAAGCAGAAA ACAGCACCTT GAACGACGTT GCTAAATCAG CATTAGCTCT TGATATTGCC
TTGGAACTTG ATGAAGAAGG TAGTCCTGCT GCCACAGAGC TTGCAAAAGC ATTTGCGACA
ACGGCCTGTC GAAGAGTGCT TGATCAGCAA CTACGTCATG CACTGCCAGA ACCTGTGTTA
CGGCTCGCAA TTCATGATAG TCAAAAAGAG AATGGAAACA TTGAATCCGA CCATGAAGAC
ACAAAGTCAA TGGATACGAA CCTACAATCA CCTTGGTGCT GTCCAACTAT CCATTACACC
GATCTTGTTA ATCAAGAGAT CATAGTGTCA CTTTTAAGTA ACGGTAAAGA TCGTCCAAAT
GTCCGACAAA AAGAAAAATT GGTTCTGGGG TCCCGTGAAT GCTGGCAACA ACTCAAATGG
CCACTATTCA GCACAAGCCA AGCAAAGAAC CTAAAAGAAA TCTGTAGTGA AAGCCTCGTG
CATCGCCTCA ACACACTGCG CCGTCAGGCT GAAGAGCTTC GACAAGACCT GATCGCCATG
GTTCAGGCAC GTATCGTGGA ACCCCTTGTT GGGGAAGAGC ATCAGGGGGT AATAAGCGGG
GTGCAGAGTT ATGGGTTCTT TGTGGAGATT CCACCTTCTA TGGCAGAGGG TCTTGTCCAC
GTCAGCTCTT TAAATGACGA TTGGTACGAA TATCGCTCAC GTCAAAACCG ATTGGTGGGT
CGGAAGAATC GCAAGATCTA TCAACTTGGT GATCAAGTAA ACGTGAAGGT TCTCAAAGTT
GACGCCTTAA GGAACCAAAT TGACCTCGAA GTTAATGGAT CAGCAATGAC TGTTGAAGCC
AATATGAATG TTGATTCAGA TCTAACCAAT CAGGTAACAA CAACAAAAAG TGAGGTCAAA
GGCAAAATCA CCAAGAATAA TGAACAACTA GTCAGCTCAA GTGAGGCATA G
 
Protein sequence
MKFTVADLLD QLPPTGGLEI KKLEKILKLT TKADRDGLET ALQGLLRLGI VNNEEAGAIK 
RSDDESLIEA RLRCSSKGFC FALRDDGGDD IYIRDHQLNH AWNGDRVLVR ITRDGGRRRS
PEGGVQCILE RTTTSLLGHV ERKDQNVLAI PLDDRILATI QLPDGDKAHL NDGEQTSVVE
VKLDRYPIAQ FPAEGHVARS LPLNGGPSSD RDLLLTKANL QDRPAPPRSS LKTPTAKHRQ
DLSSQPALLL RSWQVSDAPP LPAVYVEPHA GGTRLWIHAP AVAERLSIGN NLDLWLRDRA
EALCLGEVWH PLLGQILSKA CSFKVGEIND AVSVALDISA DGEVIDWHFS LSSIKPVAEI
GPKTLTALSN RKPKARTVPA ALKSVKDHLA QLETLIFCAR TLQMGEQASG SIELDLAVPE
LQCLGDLRWA DPDSYRHQWT LPLDQTDPQS VLSPIIRAAH RAWAQHAQNL QLPGLVIEAS
EAENSTLNDV AKSALALDIA LELDEEGSPA ATELAKAFAT TACRRVLDQQ LRHALPEPVL
RLAIHDSQKE NGNIESDHED TKSMDTNLQS PWCCPTIHYT DLVNQEIIVS LLSNGKDRPN
VRQKEKLVLG SRECWQQLKW PLFSTSQAKN LKEICSESLV HRLNTLRRQA EELRQDLIAM
VQARIVEPLV GEEHQGVISG VQSYGFFVEI PPSMAEGLVH VSSLNDDWYE YRSRQNRLVG
RKNRKIYQLG DQVNVKVLKV DALRNQIDLE VNGSAMTVEA NMNVDSDLTN QVTTTKSEVK
GKITKNNEQL VSSSEA