Gene Aazo_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3236 
Symbol 
ID9341040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3326158 
End bp3328155 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content39% 
IMG OID 
Productexcinuclease ABC subunit B 
Protein accessionYP_003722066 
Protein GI298491889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAT TTAGTCTTCA AGCTCCTTTT AGTCCCACTG GTGATCAACC ACAGGCTATC 
GCCCAACTTA TAACCAGCAT CGAAGCGGGT AATCGTTACC AAACTTTACT GGGTGCGACG
GGAACAGGTA AGACGTTTTC TATCGCGGCA GTGATTGAAA AAGTTCGTAA ACCGACCCTA
GTCTTAGCCC ATAATAAAAC CCTTGCAGCA CAACTATGTA ACGAAATGCG GGAATTTTTC
CCCAATAACG CCGTTGAGTA TTTCGTCAGC TATTACGACT ATTACCAACC GGAAGCCTAT
ATTCCCGTCA CAGACACATA TATAGAAAAA ACAGCAGCGA TTAATGATGA AATAGATATG
TTGCGACATT CTGCAACTCG TTCTCTGTTT GAACGTCGCG ATGTAATTGT TGTTGCTTCC
ATTAGCTGCA TTTACGGTTT AGGAATGCCG GCTGAATACC TCAAAGCTGC AATTCCCCTA
CAGATAGGTA TGGAAGTTGA CCAAAGGCAG ATTTTGCGAG ATTTGACATC TGTGCAGTAT
AGCCGCAACG ATGTAGAAAT GGGTCGGGGA AAATTTCGCG TCCGGGGTGA TCTGTTAGAA
ATTGGACCCG CTTACGAAGA TAGAATCATT CGTGTTGAAT TTTTTGGTGA TGAAATTGAC
GCAATTCGTT ATATTGACCC TGTAACTGGG GAAATTCTCA GTAGTTTGCA AGCGGTGAAT
GTCTACCCTG CGCGTCACTT TATCACTCCA GAACAACGTT TAGAAGTGGC TTGTGAAGAT
ATTTCCGCAG AATTAAAACA GCAGAAATTA GCATTAGAAG AACTAGGTAA ATTAGTGGAA
GCACAACGCA TAGATCAACG CACACGCTAC GATTTAGAAA TGTTACGGGA AGTCAGATAT
TGTAACGGAG TAGAAAATTA TTCTCGTCAT TTAGCAGGTA GACAAGCAGG AGAACCACCA
GAATGTTTAC TTGATTATTT TCCTAAAGAT TGGTTATTAG TAATAGATGA ATCTCACGTT
ACTGTTCCGC AAATTCGCGG GATGTATAAC GGCGACCAAG CACGGAAAAA GGTTTTAATT
GATCATGGTT TTCGGCTTCC TAGTGCGGCG GATAATCGTC CCTTAAAAGC AGAGGAATTT
TGGCAAAAAG CTAATCAATG TATTTTTGTT TCTGCTACGC CGGGAAATTG GGAAGTAGAA
GTTTCTGAAG ATAATATAAT TGAGCAAGTA ATTAGACCTA CTGGAGTAGT AGATCCAGAA
GTTCTTGTTC GTCCTACGGA AGGACAAGTT GATAATTTAT TAGGAGAAAT TAAAGATAGA
GTTGAACTGA AAGAAAGAAC CTTAATCACC ACGTTAACTA AACGCATGGC GGAAGATTTA
ACGGAATATT TGGAACATAA GGGAGTTCAA GTTAGATATT TGCATTCGGA AATTAATTCG
ATTCAGAGAA TTGAGATTTT ACAAGATTTA CGAAATGGTA AATTTGATGT TTTGGTAGGT
GTGAACTTAC TACGGGAAGG TTTGGATTTA CCAGAAGTTT CTTTAGTGGC AATTATGGAT
GCAGATAAAG AAGGTTTCTT ACGTGCAGAA CGTTCGTTAA TTCAAACTAT TGGACGGGCT
GCGCGTCATA TTCAAGGTAA GGTGATTATG TATGCTGATA AGTTAACAGA TAGCATGATT
AAAGCTATTG ACGAAACTGA TAGAAGAAGG GGAATTCAAA CAGCATATAA CAAAATGTAC
GGAATTACAC CAAAACCAGT TGTGAAGAAG TCAAGTAATG CGATTTTATC ATTCTTGGAT
GTATCGCGGC GCTTAAATAC ACGTGATTTA AAGGTGGTGG ATAAACATTT AGATGAATTA
TGGTTAGAAG ATATTCCAGA GTTAATTACG CTGTTAGAAA AACAGATGAA GGAAGCAGCG
AAAAAGATGG AATTTGAAGA AGCAGCAAAA TTGCGCGATC GCATTAAACA TCTCAGGGGT
AAAATGTTAG GAAAATAA
 
Protein sequence
MTEFSLQAPF SPTGDQPQAI AQLITSIEAG NRYQTLLGAT GTGKTFSIAA VIEKVRKPTL 
VLAHNKTLAA QLCNEMREFF PNNAVEYFVS YYDYYQPEAY IPVTDTYIEK TAAINDEIDM
LRHSATRSLF ERRDVIVVAS ISCIYGLGMP AEYLKAAIPL QIGMEVDQRQ ILRDLTSVQY
SRNDVEMGRG KFRVRGDLLE IGPAYEDRII RVEFFGDEID AIRYIDPVTG EILSSLQAVN
VYPARHFITP EQRLEVACED ISAELKQQKL ALEELGKLVE AQRIDQRTRY DLEMLREVRY
CNGVENYSRH LAGRQAGEPP ECLLDYFPKD WLLVIDESHV TVPQIRGMYN GDQARKKVLI
DHGFRLPSAA DNRPLKAEEF WQKANQCIFV SATPGNWEVE VSEDNIIEQV IRPTGVVDPE
VLVRPTEGQV DNLLGEIKDR VELKERTLIT TLTKRMAEDL TEYLEHKGVQ VRYLHSEINS
IQRIEILQDL RNGKFDVLVG VNLLREGLDL PEVSLVAIMD ADKEGFLRAE RSLIQTIGRA
ARHIQGKVIM YADKLTDSMI KAIDETDRRR GIQTAYNKMY GITPKPVVKK SSNAILSFLD
VSRRLNTRDL KVVDKHLDEL WLEDIPELIT LLEKQMKEAA KKMEFEEAAK LRDRIKHLRG
KMLGK