Gene Ava_C0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0122 
Symbol 
ID3677830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp149774 
End bp151057 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content37% 
IMG OID637715205 
ProductHNH endonuclease 
Protein accessionYP_320399 
Protein GI75812782 
COG category[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAG TATTTGTTTT AGATACCGAA AAAAGACCCT TAGACCCAAT TCATTCGGCG 
CAAGCTAGAC AGTTATTACG AAACAAAAAA GCAGCAGTAT TTCGGCGTTT TCCTTTCACA
ATTATTCTGA AAGAATCTAG AGCAGATGCA TCTGTATCTG ACTTGAGAAT TAAGCTAGAC
CCTGGAGCTA AAATAACGGG AATAGCATTA GTCAATGATT CCACAGGTGA AGTAGTTTTT
GCTGCTGATT TAAAGCATAG AGGCTTCGCT ATCAGAGATG CTTTGATTTC CAGAAGGCAA
CTAAGACGTA CTAGGAGGAA TCGCAAAACA CGATACAGAA AACCCAGATT TCTCAACAGA
ACTAGATCAG AAGGATGGTT AGCTCCTAGC CTTATGAGTC GGGTTCACAA TGTTGAAACA
TGGGTAAACA GATTACGCAA ATTCGCACCA ATCACAGCGA TTAGTACCGA ATTAGTCAAG
TTTGATATGC AATTAATGCG TAATCCTGAA ATCGAAGGTA AGGAATATCA ACAAGGTACG
TTAGCTGGAT ATGAAACCAG AGAATTTCTT CTAGAAAAAT GGAACAGACA ATGCGCCTAT
TGTGGTATCA AAGATATACC TTTACAGATT GAACACATTC ACTCACGCTC AAAGGGAGGT
TCTAATTCAA TCACTAACCT TACTTTGAGT TGCGAAAAAT GTAACGTCAA AAAAGGGACT
AAAGATATTA AAGACTTTCT CAAAAAAGAT CCCACTAGAT TACAGAAAAT TTTGGCACAA
GCCAAGAAAC CATTAGCTGA TGCAGCAGCA GTCAATGCCA CTAGATACAA ACTTCTAGAG
GTTTTAAAAT CAACTGACTT ACCCGTTGAG TGCGGTTCTG GTGGATTAAC GAAGTTCAAT
AGAACTAATC AACAATTACA AAAAACTCAT TGGTTAGACG CTGCTTGTGT TGGTCAATCA
ACTCCAATAT TAATTATCAA AGGTATCAAA CCGTTGTTAA TTACCGCTAA TGGGCACGGA
ACAAGGCAGA TGTGTAGAAC TGACAAATTT GGATTTCCTA ATAGATATGT TCCCAAACTG
AAATTTATCA AAGGTTTTCA GACAGGTGAT ATTGTTAAAG CTGTTGTCAC CAATGGAAAA
AAAATTGGTG AATACATTGG ACGTGTAGCT GTACGTTCTA CAGGTAGCTT TAATATCTCT
GCTCAGAGAG GATTGATTCA AGGAATAAAC TATAAGTTCT GTAAATCAAT TCACAAAAAA
GATGGTTACA GTTATTCAAG TTAA
 
Protein sequence
MSKVFVLDTE KRPLDPIHSA QARQLLRNKK AAVFRRFPFT IILKESRADA SVSDLRIKLD 
PGAKITGIAL VNDSTGEVVF AADLKHRGFA IRDALISRRQ LRRTRRNRKT RYRKPRFLNR
TRSEGWLAPS LMSRVHNVET WVNRLRKFAP ITAISTELVK FDMQLMRNPE IEGKEYQQGT
LAGYETREFL LEKWNRQCAY CGIKDIPLQI EHIHSRSKGG SNSITNLTLS CEKCNVKKGT
KDIKDFLKKD PTRLQKILAQ AKKPLADAAA VNATRYKLLE VLKSTDLPVE CGSGGLTKFN
RTNQQLQKTH WLDAACVGQS TPILIIKGIK PLLITANGHG TRQMCRTDKF GFPNRYVPKL
KFIKGFQTGD IVKAVVTNGK KIGEYIGRVA VRSTGSFNIS AQRGLIQGIN YKFCKSIHKK
DGYSYSS