Gene Ava_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1156 
Symbol 
ID3683351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1412992 
End bp1415916 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content41% 
IMG OID637716492 
ProductType I site-specific deoxyribonuclease HsdR 
Protein accessionYP_321675 
Protein GI75907379 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.159677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC CAGGGGAACA GAAAACTGTT CAAAGCAGGA TAATTACTTA CGCTCAAGAA 
ATTGGCTGGA CATTTGTATC TCGTGAAGAA GCGGAAACCA GACGGGGTTT TAATAATCAC
AGTGGTAACA TTCAAGAGAA GGCACAACAA GCATCCCTGT TTTTTGACGA TACCCTCTAT
CAAAAAGTCC GCCAGTTTAA CCCCAAGTAT ACCGAAGCTC AAGGCGTACT TATTGCGCTA
CTGAGTAACC TACCTGCCGA CATCTACGGA AATAGAGACT TCCTAACTTA TCTCCGCAAC
CAAGGCAAAT ATTTTTATGC CCCTGAAAAT CGGGAACTCG ATCTAAAACT CATCAACTAC
GATGACACCA CCGAAAACTC CTACGAAGTA ACGGAAGAAT ACTACATTCA TAACGGCAAG
TACGGTACAC GGGAAGATGT AGTCTTTTTA ATCAACGGTA TTCCTATCCT CGTCATCGAA
TGCAAAAACG CTACTAAAGA TGAAGCGATC GCATTAGGTG TTGATCAAAT TCGTCGCTAC
CACACCGAAA CACCAGAGTT ATTCGTTCCC CAGATGATTT TTACCGCCAC TGAAGCCATT
GGCTTTTCGT ATGGGGTGAC GTGGAATATT GTCAGGCGCA ACATTTTTAA CTGGAAAGCA
GAACAAATCG GTCAACTAGA AGCAAAAATT AAGAGTTTTT GTCAACCTAA CTATATTCTC
CAATTCCTGC AAAATTATAT TCTCTTTGCA GAGAAAGAAG AAACTCTGCA AAAATTTATT
CTGCGTCAAC ACCAAACCAC CGCAGTTGAA AAGGTAATTG AACGCTGTCA CGATCCTGAA
CACAGCAAAG GACTTGTCTG GCATACCCAA GGCAGTGGTA AGACTTTCAC AATGATTAAG
ACGGCGGAAA TGCTCTTTAA AGCCCCACAG AGCGATAAAC CGACAATTCT GCTGATGATT
GACCGCAACG AGCTAGAAGA CCAAATGCTC AGAAACCTAA TTAACTTAGG ACTGAATAAC
GTTCAACACG CTGATCGCAT TACCACCCTT AATCAACTAC TAAAAGATGA TTATCGTGGC
ATTATCGTCA CCATGATCCA CAAATTCCGC GATATGCCTG CCAATATTAA TCTAAGAAAA
AATATTTATA TTCTGATTGA TGAGGCACAC CGCACCACCG GAGGAGATTT AGGAACTTTC
TTAATGGCAG GACTCCCTAA TGCCACTTTT ATCGGATTTA CTGGTACACC TGTAGATAAA
ACTAGCTACG GTAAAGGCAC ATTTAAAACC TTCGGCACAG ACGATAAAAA GGGGTATCTG
CATAAATATT CCATTGCAGA AAGTATTGAA GACGGAACAA CACTACCCCT TTACTACAAT
ATTGCTCCCA GCAAGATAAT TGTGCCAAAG GAACTCATGG AAAAGGAGTT TCTCAACTTG
GCAGAAACTG AGGGGATTAG CGATATTGCC GAATTAAATA AAATTTTAGA TCGTGCCGTT
AATTTAAAAA ACTTCCTCAA AGGTAATCAA AGAGTAGATC AGGTAGCAAA ATATGTTGCC
AAGCATTACA CTCAAAACGT AGAACCACTA GGTTATAAAG CTTTTTTAGT CGCTGTTGAC
CGTCCTGCCT GTGCTAAATA TAAACAAGCT TTAGATAAAT ATTTACCTCC TGAATACTCC
GCAGTTGTCT ATACCGGAAA CAATAACGAT ACTCAAGAAC TCAAAACCTA CCACCTCGAC
GACAAGACAG AAAAACAAAT CCGTAAAAAC TTCGCCAAAT TTGGAGAATA CCCCAAAATC
CTGATTGTTA CCGAGAAACT GTTGACAGGA TTTGATGCTC CCCTGCTTTA TGCCATGTAT
CTCGATAAGC CCATGCGAGA TCATACCTTG TTACAGGCGA TCGCTCGTGT GAATCGTCCC
TACGAAAGCG AAGCTCAAGA GATGGTGAAA CCGCATGGTT TTGTGCTGGA TTTTGTAGGC
ATTTTCAACA AACTAGAAAA AGCACTTGCC TTTGACAGCG ATGAAATCAA CGCTATTATC
AAAGATTTGA AACTGCTCAA AACTTTATTC AAAAATAAAA TGGAGCAGCA AGCCTCAACT
TATTTGAGCT TGATTCAGAA CAATTTTAAC GATAAAGATG TAGATGATAT TATTGAGCAT
TTTCGAGAAC AAGAACGCAG AAAAGCATTT TCTAAAGCCT ATAAAGAACT AGAAATGCTC
TATGAAGTTA TTTCCCCTGA TGCTTTCCTA CGTCCCTACA TGGACGATTA CGCCACCCTT
TCCTCTATTT ATGAAGTAAT CCGCAAAGCT TACAGCAAAA GGGTTTACGT CGATAAAGCG
TTGCAGCGCA AAACTGATGA ACTCGTACAA AAACATATTG GCACAACAGC GATCGCAACT
GTAACAGATT TTGTAGAAAT CAATGCTCAA ACCCTCGAAG TCATCAAAGA CAAGCAAGGA
GGTGAAACCA CCAAAGTCAT CAACCTCATC AAAAGCATCG AAAAAACTGC CGAAGAAAAT
CCCGATGACC CTTTTCTCAT TGCAATGGCG GAACGTGCCA AAATTGTCCA AGAAAAGTTT
GAAAATCGCC AGTCCGATAC CCAAGAAGCC TTAGATACGT TGGTTAAAGA AATTGGTGAA
AATGAGCAGC GCAAGAAAGA ACAGGCTAAA CGTGGTTTTG ACAGCTTAAC GTTTTTTGTC
TTTCAAACCC TACAGGATGC AGGTATTGAT AATCCTGAAG CAGTAAGCAA TCAAATTAAA
CAAGCTTTTA TAGAGTATCC CAATTGGCGC ATCAGTGAGG CAGAATTGAG AGAGTTGCGG
AAAGAAGTCA CCTTCGCCAT TTTTGCTGAG ATCGATGAAT TGGATCAAGT GACAGCAGTT
GTTGATAAAC TTTTTATGTT GCTAAGTCAA GCTTACTCAA GCTAA
 
Protein sequence
MPKPGEQKTV QSRIITYAQE IGWTFVSREE AETRRGFNNH SGNIQEKAQQ ASLFFDDTLY 
QKVRQFNPKY TEAQGVLIAL LSNLPADIYG NRDFLTYLRN QGKYFYAPEN RELDLKLINY
DDTTENSYEV TEEYYIHNGK YGTREDVVFL INGIPILVIE CKNATKDEAI ALGVDQIRRY
HTETPELFVP QMIFTATEAI GFSYGVTWNI VRRNIFNWKA EQIGQLEAKI KSFCQPNYIL
QFLQNYILFA EKEETLQKFI LRQHQTTAVE KVIERCHDPE HSKGLVWHTQ GSGKTFTMIK
TAEMLFKAPQ SDKPTILLMI DRNELEDQML RNLINLGLNN VQHADRITTL NQLLKDDYRG
IIVTMIHKFR DMPANINLRK NIYILIDEAH RTTGGDLGTF LMAGLPNATF IGFTGTPVDK
TSYGKGTFKT FGTDDKKGYL HKYSIAESIE DGTTLPLYYN IAPSKIIVPK ELMEKEFLNL
AETEGISDIA ELNKILDRAV NLKNFLKGNQ RVDQVAKYVA KHYTQNVEPL GYKAFLVAVD
RPACAKYKQA LDKYLPPEYS AVVYTGNNND TQELKTYHLD DKTEKQIRKN FAKFGEYPKI
LIVTEKLLTG FDAPLLYAMY LDKPMRDHTL LQAIARVNRP YESEAQEMVK PHGFVLDFVG
IFNKLEKALA FDSDEINAII KDLKLLKTLF KNKMEQQAST YLSLIQNNFN DKDVDDIIEH
FREQERRKAF SKAYKELEML YEVISPDAFL RPYMDDYATL SSIYEVIRKA YSKRVYVDKA
LQRKTDELVQ KHIGTTAIAT VTDFVEINAQ TLEVIKDKQG GETTKVINLI KSIEKTAEEN
PDDPFLIAMA ERAKIVQEKF ENRQSDTQEA LDTLVKEIGE NEQRKKEQAK RGFDSLTFFV
FQTLQDAGID NPEAVSNQIK QAFIEYPNWR ISEAELRELR KEVTFAIFAE IDELDQVTAV
VDKLFMLLSQ AYSS