Gene A9601_10221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_10221 
SymbolvacB 
ID4717733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp881063 
End bp883285 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content27% 
IMG OID640078737 
Productputative acetazolamide conferring resistance protein Zam 
Protein accessionYP_001009413 
Protein GI123968555 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.222405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCACAT CATCTTCAAT AATTGATAAT CTTAATCAGT CAGAAGGGTT AGAATATAAA 
AAATTATGCA GATTATTAAA AATAACAAAG AAATCTGATA AGGATAAATT AGATATTGCT
TTAACAGCTC TAGAAAAACT TGAGATAATT AATAAAAATG AAAATGATGA ATATACTTGC
ATAAAGGATA ATGATCATCT TGTCGCCAAA ATAAGATGTA GTAGCAAAGG CTATTGCTTT
GCTGTAAGAG GAAAAGACAA GGAAGACATC TACATTAAAG AAAATCTACT TAACTATGCA
TGGAATGGAG ATAAAGTTTT AGTAAGGATA ATAAAAGAGG GATATAGAAG AAGATCACCT
GAAGGAATAG TTGATTGTAT TCTTGAAAGA TCAAATCAAA TACTTCTTTC TAAAGTAGAA
ATAATAAACA ATGATGTATA TGCAATCCCA ATAGACGATA GGATCCTTTC TAAAATTAAA
CTTCCAAAAG AAAATAAAAA ATACATTTTC AATCCAGACA ATAAGAATAT AGTAAAAGTT
GAGATTGATA GATTCCCAAT AGGTCAAGAA GAAGGTCTAG GTCATGTGAT ACAAGAACTA
AAACTAAATA ATAATGAAGA CTATGATACG GACTTTGTTT TATCTAAAAG CAATATCGTC
AAATCATACG ATTTAAATCA TATTGAATCA AAAAAAATAG AACAAAGGGA GAGAATAGAC
CTTTCAGATA AAAACTCTTA TTTATTCAAA AGTTGGAATT CTAATAATTC TCCAATGCCT
CCAATGATTC AAATAGAGCA GGGGAAAAAT AAAAATACTA AATTATGGAT ACATACAAAT
AATCTTGCAG AAAGAGTAGA TCTAAATAGT AAAAAATCTC TAGAAATATT ATTCAAAGGC
TTTGAATCAT TACCCTTATT AAATGATTGG CAAAACTACC TTGGTGAAGC CATAAGAAAT
GATTCTGAAT TTAAAATTGG TGAAAAAAAT GAAGCAATAA GCCTCTGTAT CAATTTAAAT
AGTGATAATG AAATAATTGA TTGGACATTT CATCTTACTT TAGTAAGATG CACTCTTATT
GTTGGAAGTG CTCATACTGA CGCGCTTCTA TCTAGAAAAA GCAAATCAAG AATAACATCT
CGGGTATTAA AACCTATAAA GGAATATGTC GACGATTTAG ATAAAATACT AGAAATTTCA
TGTTCATTTA GAGAAAAACA TCTTTTGGAG GGTAAGGTGG AAATTCCTGC GCCACTGAAT
AAGATTGAAG CACTAGAAGA ATTTTTTATT CACAATCCTG CTGAATATTC AAAAGGATAT
TTTGAATCAT TAAATAAAGA AGATTGCCAA ACTTACCTTT CACCAATACT TTATGAAGCT
AATTTAATAT GGTTCAAACA TTCAAATCAA TATGGATTAA AAAGTGCAGG ATACATCTCA
AATGGAATAG ATTACGTTAA TGCTAATGAA ATTATCAAAT ATTCAGAATT TATTGATAAT
GATTTAGAGC TTAATGAAGA TGGCAATTTG ACATTTAGCC AAGTAATTAA ATTATGTGAC
GACGAAAATA AAAAAAGAAT CTTACATAAA CTTCTAATTA ATGAATTTAA GGACAATGAA
ATAAGGTTGA TATCTAAAGA TGCTGATAAT GATGAATCAG AAAAATTATT TATTTCTCCA
TGGACAATTC CTGGATATGA CTTTTCAAAT CTTATAAATC AGTACTGTAT TTTTAATATG
ATAATAAATG GTAAGAAATC AAAGAAAAAT AATATAAATG AAATTAATAT ATCTGATAGT
AATTCAATAG AATTAGTAAA ATGGGATATA TTTAATTCAT CAATTTCAAA AAATCTAGAA
ATATTATTTA ACAAGTTTGT GATAGATAAA CTTAATGAAT TTAAGTACAA AGTTAACCAA
TATAAATCTA ATATGATAAA TATAAAAAAA GTAAGAAAAG CAGAAAAATT ACTAGGTAAT
ATTTATAGTG GGTTTATTTT ATCAGTGCAA ACATATGGTT TCTTTGTTGA GATATCAGAA
CTAAATGTAG AGGGTTTAGT ACACGTAAGC ACACTTAATA ATGATTGGTA TGAATACAGG
TCAAGACAAA ATCTATTGAT TGGAAGAAAA TCCAAAAAAT CATATAAAGT TGGTGATGCA
ATAGAAGTAA AAATTATAAA AGTCGATATT CTTAAATATC AAATTGATTT AGAATTAACA
TAA
 
Protein sequence
MFTSSSIIDN LNQSEGLEYK KLCRLLKITK KSDKDKLDIA LTALEKLEII NKNENDEYTC 
IKDNDHLVAK IRCSSKGYCF AVRGKDKEDI YIKENLLNYA WNGDKVLVRI IKEGYRRRSP
EGIVDCILER SNQILLSKVE IINNDVYAIP IDDRILSKIK LPKENKKYIF NPDNKNIVKV
EIDRFPIGQE EGLGHVIQEL KLNNNEDYDT DFVLSKSNIV KSYDLNHIES KKIEQRERID
LSDKNSYLFK SWNSNNSPMP PMIQIEQGKN KNTKLWIHTN NLAERVDLNS KKSLEILFKG
FESLPLLNDW QNYLGEAIRN DSEFKIGEKN EAISLCINLN SDNEIIDWTF HLTLVRCTLI
VGSAHTDALL SRKSKSRITS RVLKPIKEYV DDLDKILEIS CSFREKHLLE GKVEIPAPLN
KIEALEEFFI HNPAEYSKGY FESLNKEDCQ TYLSPILYEA NLIWFKHSNQ YGLKSAGYIS
NGIDYVNANE IIKYSEFIDN DLELNEDGNL TFSQVIKLCD DENKKRILHK LLINEFKDNE
IRLISKDADN DESEKLFISP WTIPGYDFSN LINQYCIFNM IINGKKSKKN NINEINISDS
NSIELVKWDI FNSSISKNLE ILFNKFVIDK LNEFKYKVNQ YKSNMINIKK VRKAEKLLGN
IYSGFILSVQ TYGFFVEISE LNVEGLVHVS TLNNDWYEYR SRQNLLIGRK SKKSYKVGDA
IEVKIIKVDI LKYQIDLELT