Gene Aazo_4804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4804 
Symbol 
ID9342611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4910082 
End bp4912148 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content43% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003723096 
Protein GI298492919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGA TTCTTTATGT AGAAGTTCCT ACTCCAGATA CTGGGGCTGT ACGCAACTGG 
CTACAAGTGG ATTTTGCACC AGGTAATGGA GAGAAATTGC TTACCCCAGA AGGTTTTCGC
TTGAGAAACC CTGGTGTGTC TGTGACGGAG AGTATACCAG ATGAACTGTC TATATTTGTC
TGGTCGGTAC AGCGTACTAC TTATCTCAAG GTATTCCGTT GGGCAGATAA ACCTTTTGCT
AATGAGAGGC AAATTCTTCA ACGTCTAACG ACAGGAATCC GCAGCCGCTT TCCCCATAGT
TATCCACAAC CGCCGGAAAT TGATGCTCAA AAGTCAATTT TCGCAGAGTT AGAACCTTAT
TATCCCCTAA CTGTCAAGTA TTTTCAGAAA ATGCCAAATG GGGAATATGA TCTGAAGCGT
GCTTACTGGT GGGAGCAACG TTGGCGTGAA GGGGTGAGAA ATCCTCAGCA GCCCCGTCAG
GTGTTATTTT CTGGTCAAGG GGAAGAGGGA AACGAAAGTC CCAATCTTCA ATATGACCTT
ATTTATATTG GTGGTGCTTT AGGTTCAATT CATGCGGCTT TGATGGCAAA GTTGGGCTAT
AAGGTGCTAT TGGTGGAACG TTTGCCGTTT GGCAGGATGA ACCGGGAATG GAATATTTCT
CGTGATGAAA TTCAAAGTTT GGTAAATCTG GGTTTGCTGA CAAGTGCTGA GTTAGAAAGC
ATTATTGCTA GGGAGTATAA AGACGGTTTC AATAAATTTT TTGATGCCAA TAATCCTTCT
GGCCTGAAAG CGCCGATTCT GCACACGCCG ACGGTGCTAA ATATCGCTTT GGATTCAGAT
AAGTTGCTGC AAGTATGTGG AGAAAAACTC AAAGGTGCTG GTGGTGATAT CTGGGATGAA
ACTGAGTTTA TTCGGGCTGA TATCAGCGAT GCACAGGTAA GTATTACTGT CAAGCATTTA
CCTAGCGGTA ATGAGCAGGA AGTGAGTGGA AGATTGCTGG TGGATGCAAT GGGAACTGCT
TCTCCTATTG CTTGGCAATT AAATGGTGGT AGGGCTTTTG ATAGTGTGTG TCCAACTGTG
GGAGCAGTAA TTGAGAACGG GTTTGAGCAT GGTGTGTGGG ATGTTCAGTA TGGAGATGTT
CTCTACAGCC ACGGGGATAT TTCTCGAAGT AGACAGTTGA TTTGGGAGTT GTTTCCTGGG
GTTGAGGAGG AACTAACAAT TTATCTGTTT CATTACCATG AGGTGAATGG GGAAAATCCT
GGTTCTTTGT TGGAGATGTA TGAGGACTTT TTCATGATTT TGCCGGAGTA TCGCCGCTGT
GATATGGATA AGCTGGTGTG GAAGAAGCCG ACTTTTGGCT ATATACCAGG GCATTTTAGT
GTAAGTAATA GAGATCGCAC CATTGCCTTT GATAGATTAA TTGCTATTGG TGATGCTGCT
TCCCTTCAGT CTCCCCTGAT TTTTACAGGT TTTGGTTCTC TCGTTCGCAA CCTAGAACGG
TTAACTACCC TATTAAATAC AGCCCTGAAA CACAACCTAT TGAGTTTTCA GTACTTAAAT
CAAATTCGTG CCTACCAAAG TAACGTTTCC GTGACTTGGT TATTTTCCAA AGGAATGATG
GTTCCCACAG GGAAATTTAT CCCGCCCCAA AGGGTAAACT CCATGCTCAA TACCTTCTTT
GGTTTATTAG CAGATGAACC CCCAGAAGTG GTAGATAATT TCATGAAAGA TCGTTGCGAT
TGGTTAACTT TTAACCGTTT AGCCTTGAAA GCAGCCAGGA AAAATCCTGC TTTACTGCTG
TGGATATGGC AAATGGCTGG GTTAAAAGAC TTACTGAGGT GGTTTGGTAA TTATTTCAAC
TTTGGTCGTC ACGCCCTTAT TAGCCTTTTG TTAAGTAGAT GGTTTTCAGA CTTCCTGAAA
TCAAGTCAAT CTTGGTTAGA ACCCCGTTAT CCCGGATTAT GGTTGCAATT ATTAACCATT
AATTATGGAA TTTCTACTGG TAAACCACAG CGGCATAATC AGGTAGTTAA AGTTAACTCA
AAAGCAGTAA CTCAAATTGG TCATTAG
 
Protein sequence
MKEILYVEVP TPDTGAVRNW LQVDFAPGNG EKLLTPEGFR LRNPGVSVTE SIPDELSIFV 
WSVQRTTYLK VFRWADKPFA NERQILQRLT TGIRSRFPHS YPQPPEIDAQ KSIFAELEPY
YPLTVKYFQK MPNGEYDLKR AYWWEQRWRE GVRNPQQPRQ VLFSGQGEEG NESPNLQYDL
IYIGGALGSI HAALMAKLGY KVLLVERLPF GRMNREWNIS RDEIQSLVNL GLLTSAELES
IIAREYKDGF NKFFDANNPS GLKAPILHTP TVLNIALDSD KLLQVCGEKL KGAGGDIWDE
TEFIRADISD AQVSITVKHL PSGNEQEVSG RLLVDAMGTA SPIAWQLNGG RAFDSVCPTV
GAVIENGFEH GVWDVQYGDV LYSHGDISRS RQLIWELFPG VEEELTIYLF HYHEVNGENP
GSLLEMYEDF FMILPEYRRC DMDKLVWKKP TFGYIPGHFS VSNRDRTIAF DRLIAIGDAA
SLQSPLIFTG FGSLVRNLER LTTLLNTALK HNLLSFQYLN QIRAYQSNVS VTWLFSKGMM
VPTGKFIPPQ RVNSMLNTFF GLLADEPPEV VDNFMKDRCD WLTFNRLALK AARKNPALLL
WIWQMAGLKD LLRWFGNYFN FGRHALISLL LSRWFSDFLK SSQSWLEPRY PGLWLQLLTI
NYGISTGKPQ RHNQVVKVNS KAVTQIGH