Gene Aazo_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4420 
Symbol 
ID9342222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4498688 
End bp4501360 
Gene Length2673 bp 
Protein Length890 aa 
Translation table11 
GC content45% 
IMG OID 
ProductDSH domain-containing protein 
Protein accessionYP_003722851 
Protein GI298492674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTATC CCGCCCCGTC TTCAGAAATT AACTTAGGGT CGATTTTTCC CTTTGAACTG 
GATCAATTCC AACAGGAAGC GATCGCCTCC TTGAATGCTG GCCGCTCAGT AGTTGTCTGT
GCACCCACAG GTTCAGGTAA AACATTAATT GGGGAATACG CCATTTATCG AGCCCTAGCG
CGAGGGAAAC GAGTATTTTA TACCACACCC CTCAAAGCGT TATCGAATCA AAAATTACGT
GACTTTCGAG AGAAATTCGG GTTTGATCAG GTTGGACTGT TAACCGGGGA TGCCTCCATT
AACAGGGATG CCCCAATTTT AGTCATGACC ACCGAAATTT TCCGCAATAT GCTCTATGGC
ACACCCATAG GACAAATTGG CATCTCCTTG GTAGATGTGG AGGCTGTTGT CCTGGATGAA
TGCCACTACA TGAATGATCG CCAACGGGGT ACGGTGTGGG AAGAATCAAT CATCTATTGT
CCCCATGAAG TGCAACTTGT CGCCCTTTCC GCTACTGTTG CTAACAGTGA CCAACTCACC
GATTGGCTAA ATCGTGTTCA TGGTCCAACA GACCTGATTT ATTCCGATTT TCGCCCAGTT
CCCTTAGAAT TTAACTTTTG CAATCCCAAA GGACTCTTCC CCCTCCTCAA TGAAAGTAAA
ACCAAAATTA ACCCCAGGCT GATTAAACGG GGTAAAAAAG GACCAGGGGA AAAAGGCAAA
GGCGGTAGGC CAGAAGCACT CAGTATCATT TATACCATCA GCCAATTAGA GCAACGGGAT
ATGCTGCCAG CCATTTTCTT TATTTTCAGC CGCCGAGGAT GTGATAAAGC CGTGGCAGAA
GTAGGGGATT TATGGTTAGT CAATAATGAC GAATCCCAAA TATTACGCAG ACAGATTGAT
GAATTTTTAG CTCGTAACCC AGAAGCAGGA CGTTCTGGAC AAATTGCCCC TTTGTACCGA
GGAGTTGCAG CCCACCATGC CGGCATTTTA CCCGCCTGGA AAGTTTTGGT AGAGGAACTA
TTTCAACAGG GGCTAATTAA AGTCGTCTTT GCAACGGAAA CCTTGGCAGC GGGAATTAAT
ATGCCTGCAC GGACAACGGT AATTTCTACT CTTTCCAAAC GCACCGATAA TGGACACCGG
TTGTTGAAGG CTTCGGAATT CCTGCAAATG TCAGGTCGGG CAGGTCGTCG GGGCATGGAT
TTACAAGGCT ATGTGGTGAC ATTACAAACT CCCTTTGAGG GGGCAAAAGA AGCTGCATAT
TTGGCTACAT CTCCAGCAGA TCCCCTAGTT AGTCAGTTTA CACCCAGCTA TGGGATGGTG
CTGAATTTGC TGCAAACCCA TACTTTGGAA CAAGCTAGGG AACTGATAGA ACGCAGTTTT
GGACAGTATA TGGCAACGTT GTATTTAAAG CCAGAATACG ACGAAATGGG GGAAATAAAA
GCAGAATTAG CAAAAATTCA GGCAGAATTC GCAGCAATTG ATGAAAATGA ATTGGCGCTG
TATGAAAAAT TGCGACAACG GCTAAAAGTA GAACGTCATA TTCTAAAAAC TCTCCAAGAA
CAAGCACAGA CAGATAGACA AGAACAATTG TCGATGATGC TAGACTTTGC CGTGTCTGGG
ACGCTTTTGA GTCTCAAAGA TAAAAGTATG ATAGCAACTT TGCCAATTAC AGCAGTTTTG
GTTGAGAAAG CCCCTGATGT TGGTCAAGCT TCTTATTTTG TCTGTTTGGG TCAAGATAAC
CGTTGGTATG TGGCAACAGT TGCGGATGTG GTAGATTTGT ATGCTGAATT GCCCAGGGTG
GAAGTGTCGC ATGATATTTT ACCACCAGCA GAATTGGCAT TAAAAAGGGG TCAGTGTGTG
TGTGGTAATC AAGAAACTGC TGCGATTGCT CAGAGTATAC CCGACCCAGG GGAATTTATG
TATATGCCAC CAGAAGTTGT GGAACAACTT GCTCGGTTTA ATGCTGTGCA AGCACAATTA
GAAAATCACC CCTTACATCA ATCAGGAAAC ATAGCCAAAA TATTTAAAGA CAGAGCGCGT
TGTGTAGAAT TGGAAGCCGA ACTGGAAGAG TTACAAGAAC AGGTAGAACA ACAGTCTCAA
AGACATTGGG AACAATTTCT GAATTTAATT CAAATCTTGC AGCAGTTTGG CGGTTTAGAT
AACTTAGTAC CCACAACACT GGGACAAATG GCTGCTGCGA TTCGGGGTGA AAATGAATTA
TGGTTAGGTT TAGCGATCGC CAGTGGTGAA CTAGATAGTT TAGACCCCCA CCATTTAGCT
GCTGCGGCTG CGGCCTTAGT GACAGAAACC CCACGTCCAG ATAGTAAAGT ACATTTTGAC
CTCAGTAGTG AAGTAGCTGA TGCTTTGGCA AAGTTGCGGG GGATTCGTCG CCAATTGTTT
CAAATACAAC GACGCTATAA TGTGGCACTG CCTATTTGGT TAGAATTTGA ATTAATCGCC
ATTATCGAAC AGTGGGCTTT AGGTATGGAT TGGGTACAAC TATGTGCAAA TACCACTTTG
GATGAAGGTG ATGTAGTCAG GCTTTTACGC CGGACTCTAG ATCTATTATC CCAGATTCCT
CATGTTCCAT TAGTCCCGGA CTCTTTGCGG AAAAATGCTC AACGGGCTAT GCAGTTAATT
GATAGATTCC CTGTGAATGA GGCTATGGAG TAA
 
Protein sequence
MNYPAPSSEI NLGSIFPFEL DQFQQEAIAS LNAGRSVVVC APTGSGKTLI GEYAIYRALA 
RGKRVFYTTP LKALSNQKLR DFREKFGFDQ VGLLTGDASI NRDAPILVMT TEIFRNMLYG
TPIGQIGISL VDVEAVVLDE CHYMNDRQRG TVWEESIIYC PHEVQLVALS ATVANSDQLT
DWLNRVHGPT DLIYSDFRPV PLEFNFCNPK GLFPLLNESK TKINPRLIKR GKKGPGEKGK
GGRPEALSII YTISQLEQRD MLPAIFFIFS RRGCDKAVAE VGDLWLVNND ESQILRRQID
EFLARNPEAG RSGQIAPLYR GVAAHHAGIL PAWKVLVEEL FQQGLIKVVF ATETLAAGIN
MPARTTVIST LSKRTDNGHR LLKASEFLQM SGRAGRRGMD LQGYVVTLQT PFEGAKEAAY
LATSPADPLV SQFTPSYGMV LNLLQTHTLE QARELIERSF GQYMATLYLK PEYDEMGEIK
AELAKIQAEF AAIDENELAL YEKLRQRLKV ERHILKTLQE QAQTDRQEQL SMMLDFAVSG
TLLSLKDKSM IATLPITAVL VEKAPDVGQA SYFVCLGQDN RWYVATVADV VDLYAELPRV
EVSHDILPPA ELALKRGQCV CGNQETAAIA QSIPDPGEFM YMPPEVVEQL ARFNAVQAQL
ENHPLHQSGN IAKIFKDRAR CVELEAELEE LQEQVEQQSQ RHWEQFLNLI QILQQFGGLD
NLVPTTLGQM AAAIRGENEL WLGLAIASGE LDSLDPHHLA AAAAALVTET PRPDSKVHFD
LSSEVADALA KLRGIRRQLF QIQRRYNVAL PIWLEFELIA IIEQWALGMD WVQLCANTTL
DEGDVVRLLR RTLDLLSQIP HVPLVPDSLR KNAQRAMQLI DRFPVNEAME