Gene Ava_C0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0233 
Symbol 
ID3678032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp274301 
End bp277003 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content41% 
IMG OID637715313 
ProductType III restriction enzyme, res subunit 
Protein accessionYP_320507 
Protein GI75812890 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.179652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00064314 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATATCG AATTCAAGCA AGGCAATAGT GTATCTCATC CCAAACACGG CAATGGTCAG 
GTAAGAACTG ATGAAGGAAC TACGGTTATT GTCCGTTTTG AGCATGGTTT GGAAGAATGT
CCTAGAGAAG AACTCACTCG TCTCTCTTCA CTGGAAGAAG CTATCAATTC CCTAAAGTCG
CATAATTCGC TTGAGGTGAT TGCTCGCGTG CAAGCATTGG CAATACGCTC GGTTAATGAT
GTTTGGGGTA TATTTTCTCT GGCACGGATT GCTCTACTAC CCCATCAACT ATGGGTCTGT
CGTCGGGTTG TTCAGGAACT ACCAGCCCGT TGCTTGGTTG CTGATGATGT TGGGCTGGGG
AAAACTATTG AGGCAGGTCT TATTCTTTGG ACACTTTTGA GTAAGGGAGC AGTGAAGCGT
ATCTTAATTG TCTGCCCTGC TTCTCTTGTT GAGCAGTGGC AACACCGACT GCGGGTGATG
TTTGATATTC GCTGTACACG ATACATAACT GAAGCAGATA CATTGAAAAG CGATTTCTGG
AACACGCATC ATCAGGTAAT TGCTTCATTG CCTACACTCC GCAAGAATAG CGGCGATCGC
CATAAACGTC TATTTGATGC AGAACCTTGG GACTTACTAA TTGTTGATGA GGCACACCAT
CTTAATGCTG ATGAACAAAG TGGGCCGACC CTTGGATATA GTTTTATAGA TAAGTTGGTT
AACAGGTATA AAAAAGTCCG TTCAGTTGTG TTTTTTTCGG GAACACCGCA TCGCGGCAAA
AACTATGAGT TTTTCTCTCT ACTTAAGCTT TTACGAGAAG ACCTGTTCCA TCCCAAAAAA
CTTTTGAAGG AGCAATTGCA ATCGCTGCGA AGTGTAATGA TTCGCAACAA CAAGCAGTCT
GTTACAGATA TGAAGGGGAA TAAATTATTT CTACCTCTAC AGGTGCGATC GGAAATTTAC
GATTATTCGC CTGAAGAGTC ACTGTTCTAC GAACAGCTAA CTGAGTTTAT CCTGACAGGA
AAAGCTTATG CTTCTGGACT TGATGATTTT AACCAACGGG CAGTCATGTT GATTTTGATT
TGTATGCAAA AGCTTGCTTC TAGCTCAGTA GCAGCAATTC GCAGAGCATT AGAAGGTCGC
TTAAGACGAA TTGATAACAA CCGGAAGCAA CTCGATAAGG TGAAACAACG AAAACAGGAG
TTAGAGCAAG AATTCAATGA CCTTGAGGAC GAGGAAAAAC TCAATAGTGA CGAAATTAAT
CGTTTGGAAG AACGGATTCT GGAATTAACA GAAATGATCC GACTCATGGA AGATGAGGAA
CCTCGTCTGA GAGAACTTGT ACAAGCTGCC AGTGCCATCA GGGAAGAAAC AAAAATTCAG
AAAATTCTGG ATGTGGTTGC CAATCAGTTT ACTGACCGCC AAGTGCTTTT TTTCACAGAG
TATAAAGCAA CGCAATCATT ACTAATGTCT GCGTTGGTTG CTCAATATGG CGATCGGTGC
GTTACTTTTA TTAATGGTGA CGAACTTGCA GAGGGAGTTA TTAGCGCATC TAATCAATTT
ATAAGTCTGC GAGAGAAACG CGAAAATGCT GCCGAAAAAT TTAACAAGGG AGAGGTCAGA
TTTCTTATTT CCACTGAGGC TGGTGGTGAA GGTATCGACT TGCAAGAAAA TTGTTATTCT
CTGATTCATG TAGACTTGCC TTGGAATCCT ATGCGTCTAC ACCAACGAGT TGGCCGCTTG
AATCGTTACG GGCAAAAACG AGCAGTAGAA GTAATTACAC TACGTAACCC CCAGACTGTA
GAAACCCTAA TTTGGGACAA ACTCAATCGC AAGATTGATA ACATCATGCA ATCTCTACAG
CAAGTAATGG ATGAACCAGA AGATTTGCTA CAACTTGTTC TGGGGATGAC ATCACCAAAG
TTATTTCGAG AAATTTTTAG CGAAGCTGAC AGACATCATG AAGATTTAAA TCAATGGTTC
GATAGCAAAA CTGCTAAATT TGGTGGAAAA GATGCGATCG CTACTGTACA GGAAATTGTT
GGGAGCTGCG ATAAATTTGA CTTTCAACAA GTTTCTCCTT TACTACCACA ACTTGATTTA
CCCGATTTAC AATCATTTTT TTTGACAATG TTGCAACTGA ATAAGCGACG AATAAAATTT
ACAGATGAAG GCTTCTCATT TATAACACCT GATACTTGGC GTACTGAATC AGCAATTCAA
AGAGAGTACG AAGAGGTACA CTTTAATCGT TATCGAGGTG ATAAAAATCC TGCACATCAT
TTGCTTGGCG TTGGTCACAA ACTGTTTAAT GCAGCTTTGT CAGAAGCTAC GAGTTTTACT
GCTTGTGTAA CTGTCTTACC TTGTATAAAA CAACCTTTAT TTATCTTTTT GATTATTGAC
CGCGTGACTG GTATCCAGTC TAATGTACGG CAAGCTGTTG TAGGGGTAAG CATAGATTTA
TTCAAGCAAG CAACTATTGT GAAAGATTGG GAACTAATAG CCACATTAAA TCAATACTTA
CCAGAATTAA AGAAAATGTC TGAATCTTCT AATGCTTGTC AACTAAACGC GAATGAAATT
AGTCGATTGC TAGCCGATGC AGAACAGTTT TTAGAAAATA ATTTGAATGG CTTAAATCTT
CCTTTCAAAG TGCCAGCTAT TCAAGCAATC AGCGTAATTC TTCCAGCCAA GGAAGCAATT
TAA
 
Protein sequence
MDIEFKQGNS VSHPKHGNGQ VRTDEGTTVI VRFEHGLEEC PREELTRLSS LEEAINSLKS 
HNSLEVIARV QALAIRSVND VWGIFSLARI ALLPHQLWVC RRVVQELPAR CLVADDVGLG
KTIEAGLILW TLLSKGAVKR ILIVCPASLV EQWQHRLRVM FDIRCTRYIT EADTLKSDFW
NTHHQVIASL PTLRKNSGDR HKRLFDAEPW DLLIVDEAHH LNADEQSGPT LGYSFIDKLV
NRYKKVRSVV FFSGTPHRGK NYEFFSLLKL LREDLFHPKK LLKEQLQSLR SVMIRNNKQS
VTDMKGNKLF LPLQVRSEIY DYSPEESLFY EQLTEFILTG KAYASGLDDF NQRAVMLILI
CMQKLASSSV AAIRRALEGR LRRIDNNRKQ LDKVKQRKQE LEQEFNDLED EEKLNSDEIN
RLEERILELT EMIRLMEDEE PRLRELVQAA SAIREETKIQ KILDVVANQF TDRQVLFFTE
YKATQSLLMS ALVAQYGDRC VTFINGDELA EGVISASNQF ISLREKRENA AEKFNKGEVR
FLISTEAGGE GIDLQENCYS LIHVDLPWNP MRLHQRVGRL NRYGQKRAVE VITLRNPQTV
ETLIWDKLNR KIDNIMQSLQ QVMDEPEDLL QLVLGMTSPK LFREIFSEAD RHHEDLNQWF
DSKTAKFGGK DAIATVQEIV GSCDKFDFQQ VSPLLPQLDL PDLQSFFLTM LQLNKRRIKF
TDEGFSFITP DTWRTESAIQ REYEEVHFNR YRGDKNPAHH LLGVGHKLFN AALSEATSFT
ACVTVLPCIK QPLFIFLIID RVTGIQSNVR QAVVGVSIDL FKQATIVKDW ELIATLNQYL
PELKKMSESS NACQLNANEI SRLLADAEQF LENNLNGLNL PFKVPAIQAI SVILPAKEAI