Gene Aazo_2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2854 
Symbol 
ID9340654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2936643 
End bp2938268 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content40% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003721818 
Protein GI298491641 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA CAAAGCAGAA ATCTCTGCGT GGAAAAAAGC AATCTTCACC AGAGAAAACC 
CGCCTGAGCT TGAAAGAAGA GTTAGCCCAA AAGCGCAAAG CCACCATAGC ACGTAAAGAG
TTGACCAGCT TAGTTGGCAA ACTGGTAGGA AGCGGACTAT TTTTAGGAAT GCTGCTATTT
TTCGTCGGTG GAATTAAATT AGCAGTTCCT GGTGCATTAG GTATCATAGT CATTACCCTT
TGTTATAAAA ACCCGCTACC TGCTCTATTT GCCTTTGTTA TGTATGTACC ATTCGCCGGT
ACTATTATTT ACTTCTTGGG CAACAGTCCT GTACTTCAAC TAGCTAAAGA TGCTTTCTAT
GTTCCAGTAG TGATCGCTCT GTGGCAAAGT TGCAAAAAGC AAAAACAACC CTTCATTATT
CCTCAATCCA TCAAAACCCC ATTTTTGATT CTCCTTAGCT GTTCTATCCT CACCCTAGTG
ATGATAAATG GTGGACAGCA GTTAAATCCG GCTCGTGGCG ATATACCTAT AGGCATAGGA
ATTCTGGGAT TAAAAGTATT TCTAGGATAT TTTCCTGTAA TTACTTGCGT CTATTACCTA
ATTCTTAATC AGCAGGATTT TTGGTTGTTA TCCCGCCTTC AGATTCTCCT CATACTAGTC
TGCGGCATCT TGGGAGTTAT TCAATTTATC TTCCTCACAA TTGGAGTATG TAAAGGGACG
GTAGGCGTTG AAGGAGACGC TTTATTTAAG GCAACACTTG ATGCTCGGTG TTTAGTTGGT
GGTGCGCTCT TATACACACC AGAACAAGGA GTAATTCGCT TACCAGGAAC ATTTGTAGCC
CCTTGGCAGT GGGCATGGTT CTTAATTTCC AGCACCTTTT TTACATTTGC TACAACTTTT
AGCGACAAAT CTATTATTTG GCGGCTGATC AGTTTGGTTA CTTTAGGATT AGTCTTTTTT
AACGCAGTTA TCTCTGGACA AAGAATAGCC TTAGCTTTAG TACCAGTATG TTTCGCGCTT
TTGTTGTTGT TAACTGGTCC ATTGGTCAAC CTCAAAAAGG TTATCCCCTT GGGAGGAGCT
TTCGCTGTAA TTTTGGTAAT TGCAATGGCA GCTAATCCCA CTATCGTACA AGACAGAATG
AACAGTTTTA TCGGTCGATG GAATGCATCA CCACCTCATC ACTTTATAGT TGATCAATTG
CAAGAAAACT GGAAAAGTGT TGATACTCCT ATAGGTAGCG GCTTAGGTCG AGCTACGAAC
TCTGCCCGTG TATTTGGTTC AACCAAGTTG GTGGAAACCT ACTATCCTAA AGTGCTGTAT
GAAGTTGGAA TTGTCGGAGT CTTGGCTTTT TTGGTCTTTG TCACCAGTCT AACCGTTGCT
ACTTTTAAGA CATATCGCAC AATAAAAAAC CGTAACTTAC GAACCTATGG TGCTAGTATG
TGGGTGTTCG TACTATTTAT CAGTTACAAT ACTTACTACT ACCCCCTAGA TGTCGATCCA
GTTGCTGTTT ATTATTGGTT GTGTGCAGGA ATAATATTCA AATTACCAAT TCTGGATAAA
CAAGAAATGC CAGAAGAAAC AACCATCAAT ACCAAAGGTC AGAAAAAACG GCTGTATCAA
AAATAA
 
Protein sequence
MAKTKQKSLR GKKQSSPEKT RLSLKEELAQ KRKATIARKE LTSLVGKLVG SGLFLGMLLF 
FVGGIKLAVP GALGIIVITL CYKNPLPALF AFVMYVPFAG TIIYFLGNSP VLQLAKDAFY
VPVVIALWQS CKKQKQPFII PQSIKTPFLI LLSCSILTLV MINGGQQLNP ARGDIPIGIG
ILGLKVFLGY FPVITCVYYL ILNQQDFWLL SRLQILLILV CGILGVIQFI FLTIGVCKGT
VGVEGDALFK ATLDARCLVG GALLYTPEQG VIRLPGTFVA PWQWAWFLIS STFFTFATTF
SDKSIIWRLI SLVTLGLVFF NAVISGQRIA LALVPVCFAL LLLLTGPLVN LKKVIPLGGA
FAVILVIAMA ANPTIVQDRM NSFIGRWNAS PPHHFIVDQL QENWKSVDTP IGSGLGRATN
SARVFGSTKL VETYYPKVLY EVGIVGVLAF LVFVTSLTVA TFKTYRTIKN RNLRTYGASM
WVFVLFISYN TYYYPLDVDP VAVYYWLCAG IIFKLPILDK QEMPEETTIN TKGQKKRLYQ
K