Gene Aazo_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1999 
Symbol 
ID9339792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2077922 
End bp2080399 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content45% 
IMG OID 
ProductMutS2 family protein 
Protein accessionYP_003721191 
Protein GI298491014 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCCAGT CTGAAACCTT AGAACTACTA GAATGGCATC GCCTCTGCCA GCACCTTTCC 
ACCTTTGCGG CAACTAAGTT AGGGGCGATA GTTGCCCGTG CCTTGCCGAT ACCGACAACT
CTAGAGGAAA GTAAAGAGTT GTTAGCACAA ACCAAAGAAG TCTATCAACT GGAAAGCCTG
CTGACAAAAG GGTTATCCTT TGAGGGCATT CAAGATATTG GTGATTCCCT AGAACGTGCA
GAACTACAAG GGATTTTGTC TGGTGAGGAA CTTTTGGCTA TCGCTACCAC CCTCGCTGGT
GCAAGAAATT TACGTCGGTT AATTGACAAT CAGGAAGATA TACCAATTTT CGCTGAGTTA
GTTGCCGAAT TACGAACTTA CCCGGAATTA GAGCAGGAAA TTCACCGTTG TATTGATGAA
CGAGCACAAG TTACAGACCG CGCTAGTCAA AAATTAGGAG AGATTCGGGA ATATTTGCGG
AAATCACGTG GTCAAATTAC CCAAAAACTG CACAATATCA TTCAAGCAAA ATCGGGAGCA
TTACAAGAAC CTATTATCAC TCAACGGGGT AGTCGCTATG TTATCCCCGT AAAAGCACCA
CAAAAAGATG CAGTTCCTGG TATTGTTCAC GATACATCCA CTAGTGGTGC AACTTTGTAT
ATAGAACCTA ATAGCATTGT GTCAATGGGC AACCAACTGC GCCAAGCTAT TAGAAGAGAA
CAGGCCGAAG AAGAAGCCAT TCGTCGCAGT TTAACAGAGC AAGTCGCCGC AGTTAAACCC
GATTTAGAAA AGTTATTAGC AATTGTCACT ACTTTAGATA TAGCTACAGC TAAAGCTAGG
TATAGTTTCT GGATAGGTGC TAATCCACCC CGGTTTGTAA ATCGTCAAGA ACAGCAAATA
ATTACCCTGC GGCAATTACG CCATCCCCTG TTAGTTTGGC AACAACACCA TGAACAGGGA
CATCCAGTAA TTCCTGTTGA TTTATTAATT AGTCCCCACA TAAAAGTCGT CACAATTACT
GGACCCAATA CTGGTGGTAA AACTGTAACG TTAAGAACCT TGGGATTAGC AGCTTTAATG
GCCAAAGTGG GCTTATTTGT TCCCGCCCGT GAACCAGTCG AGATTCCTTG GTTTGACCAG
GTTTTAGCCG ATATTGGGGA TGAACAATCT TTACAACAAA GTTTATCAAC ATTTTCCGGG
CATATTCGCC GTATTAGTCG GATTCTAAAT GCTTTAGGCA CTGGGGATCG GGGAGTCGGG
ACTAGGGACT GGGAGATGGG GGAGATGAGA GGAGATGGGG GAGATGGGGA AGGAATTTTC
CTAATGCCCA ATGCCCAATG CACCATGCCC AACTCCCTAG TCTTACTTGA TGAAGTGGGT
GCGGGTACTG ATCCGGTAGA AGGTAGTGCA TTAGCGATCG CTCTTTTACA ATATCTCGCT
GATCATGCCC AATTAACCAT TGCTACCACG CACTTTGGGG AATTGAAAGC TTTGAAATAT
GAGGATATTC GCTTTGAAAA CGCCTCAGTA GAATTTAATG ACGCTACCCT TTCACCAACC
TATCGGTTAT TGTGGGGTAT TCCTGGACGT TCCAACGCCT TAGCCATTGC CTTGCGTCTG
GGATTAAAAC CAGAAGTGGT AGAAGCTGCA AAAAGTCAAG TAGGAGAAGC CACGGACGAA
GTTAATCAAG TAATTGCAGG GTTAGAAGCA CAGCGGCGCA GTCAGGAAAC CAAAGCGGCG
GAAGCGCAAG AATTATTACG TCAAGCCGAA AAATTATACA AAGAAGTTTC CCAAAAAGCC
ACAGCTTTAC AAGAACGAGA AAAAGATTTG CGTGCTTCCC AAGAAGTAGC AGTACAGCAA
GCAATCATCC AAGCTAAAGG AGAAATTGCC GAAGTTATTC GCCGTTTGCA ACAAGGAAAA
CCCACCGCCC AACATGCCCA GGAAGCAACT AGTAAGTTAA GTGAAATTGC GGAGAGATAT
CAACCCACTC CACCACCCAA ACCCAAACCA GGGTTTATGC CCAAAGTAGG AGATCGCATC
CGTATTCGTA AACTAGGGCA AACCGCGGAA GTGTTAACAG CCCCTAATAC AGATGGAGAA
TTCAGTGTGC GCTTTGGCAT CATGAAAATG ATGGTGCAAT TACAAGACAT AGAGTCCTTA
GAAGGACAGA AACCAGAACC CATTGCTAAA CCGAAACCAG CACCAGCCGT TACCACACCA
CCAGCACCAG CTTTAGCTAT TCGTACCTCG AGAAATACCG TTGATTTGCG CGGAAAAAGG
GTAGTTGATG CCGAATATAT CTTAGAAAAA GCGATTTCCG AGGCTGATGG ACCTTTATGG
ATTATTCATG GATATGGTAC AGGTAAGTTA AAGCAAGGAG TTCACGCTTT TTTGCATCAA
CATCCCAGAG TCAGCCACCA CGAACCCGCA GAACAAGCAG ATGGCGGGAC TGGTGTCACA
ATTGCTCATG TGGAATAG
 
Protein sequence
MIQSETLELL EWHRLCQHLS TFAATKLGAI VARALPIPTT LEESKELLAQ TKEVYQLESL 
LTKGLSFEGI QDIGDSLERA ELQGILSGEE LLAIATTLAG ARNLRRLIDN QEDIPIFAEL
VAELRTYPEL EQEIHRCIDE RAQVTDRASQ KLGEIREYLR KSRGQITQKL HNIIQAKSGA
LQEPIITQRG SRYVIPVKAP QKDAVPGIVH DTSTSGATLY IEPNSIVSMG NQLRQAIRRE
QAEEEAIRRS LTEQVAAVKP DLEKLLAIVT TLDIATAKAR YSFWIGANPP RFVNRQEQQI
ITLRQLRHPL LVWQQHHEQG HPVIPVDLLI SPHIKVVTIT GPNTGGKTVT LRTLGLAALM
AKVGLFVPAR EPVEIPWFDQ VLADIGDEQS LQQSLSTFSG HIRRISRILN ALGTGDRGVG
TRDWEMGEMR GDGGDGEGIF LMPNAQCTMP NSLVLLDEVG AGTDPVEGSA LAIALLQYLA
DHAQLTIATT HFGELKALKY EDIRFENASV EFNDATLSPT YRLLWGIPGR SNALAIALRL
GLKPEVVEAA KSQVGEATDE VNQVIAGLEA QRRSQETKAA EAQELLRQAE KLYKEVSQKA
TALQEREKDL RASQEVAVQQ AIIQAKGEIA EVIRRLQQGK PTAQHAQEAT SKLSEIAERY
QPTPPPKPKP GFMPKVGDRI RIRKLGQTAE VLTAPNTDGE FSVRFGIMKM MVQLQDIESL
EGQKPEPIAK PKPAPAVTTP PAPALAIRTS RNTVDLRGKR VVDAEYILEK AISEADGPLW
IIHGYGTGKL KQGVHAFLHQ HPRVSHHEPA EQADGGTGVT IAHVE