Gene Aazo_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4454 
Symbol 
ID9342256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4539060 
End bp4540943 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content33% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003722882 
Protein GI298492705 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAT CAGAAATTTT GGCGAAGCTT ACTCAACAAG GTGTTCAATT TTGGGTAGAA 
AATAACAAAA TTAATATCCG TTCTCCTAAA GGTGTAATAA CATCAACAAT TAAGGCAGAA
ATAGCTACAT ATAAAGGAGA TATTTTAGCA TTATTACAGG AGATGAACCT TGTTACAAAA
TCTGCTTCTG AACCATTAAG TCAGGGAATC AGCTTGCCAA CTATTGGGAG ATTAATAGGT
GGTTTTGCTG GTGAATCACC TGTAGGATAT CAGCCACCAA TTATTAACCC CAAATTAATG
GCTCAAAACC TCAATGTTAC ATTTAGACCT TTACCTGATG GCTATCATAA TCAAATTATT
ATGAAATTCC GGCAGGAATT AGCATTAAAA TTAAAAAGTT TTGGAGTTAA TGTTCTCTCT
TGGCAGGAAG CGACAACAGA TATTTTTTAT GATATTAGAA TTCCTATATT AAACTTGAAT
TGCTCATTTA AAATTAAGGG AGTTCGAGCA GAAATTGATG CAGTAATAGA TGTGGAAAGA
CCAAATTCAT GGCTGAGAAA GTTAGGAATA TTCATAGCTG AAACTTTTTA TAAGTTATCT
TATCCTTGGT TACTCAATCA GCAAAAAATG TCTGTGGTAC AAATTGCTAA ATTAAGTAGT
TGGGCTGAAG ATCATGCTGC TAAATATGTT GAAGATCCAA CCAATACGCA GGTGATCATT
CTTAGTGATA TAAATTATGA TTTTATCAAT CCCTTAACGA AATATCAGGA AAAAATTAGG
ATTGGTATTA ATACACTAAT TAAAACATTC TCAGAAATCG TAATTGGCAT ATCTCCTGAG
CAAATTTCTA TCCTGAATAT GAATCTTTCT GATTCTACTT TCTTTAAATC AGAAATGGAT
GCTTTTGTTT CAAACTCACT TATTCCTAAA GTTTTTGTCC CTATTACTCC TCTATTAATG
AGTAGATTTA AAATAGCACA ATATAATCCT TATATGTCTA AATATACCCC TAAATTGGTT
AAACTAGGTC AAGAATTAGC CTCAACTGGT CTACTTCCAC CTGGATTTAA GTTGGCTGAA
CTTATTAAAA GAAAATCCCA CAGAGATATT GTCAATGTTA TTGTTAATGG TAGAACTGGT
GTTTCTTACG GGTTTGTAGC TTATGCTGAA CCTCCCTACT ATGTAGGAAA ACCAGAAATC
TCTACTTATG AATGGGATAA ATTATTACCT GTTGCTGGAT TTAGTAGTAA TGAAATTCGG
AAAAATGATG AAAGTAGACG TTATATAAAA ATCATCATTA ATGGAGAATA TGTATTTAAG
CAAATTCCCG ATATTTGGCT AGTGAGTTCT CGTTCTGGTT CTAATAAAAC AGACTTAAAT
CTTGAAGAAG ATATTATTCG TATTGGTTTA AAAGATGATT TACATTTGCA GTTACCTGTA
GGAAGTTTGT CACGTAAATC TGATTTCAAA CCTTCTTATG ATATCTATGT GATGCTGGCT
ATTAGTCTAG CTGCTGCTTT ATATACGCCA GAATTAATCA AAAATGGTGC GCCAATTGTT
CATTTTCACG GTTATCCGGC ATTTGATTGG TTTAAAGAAA ATGAATATTG CGTCGGTGTT
AATAATCCTT CTGTACCCTG TGGAACTTAT GAATCAGGTG TGTTTAATTT TTTAGGTCTT
TCTAATTTAG CTAGTCAACA AACGAAAAAT ATTAAATTAG TGAGTTTGAT AGAACCAGAT
CATGGTACAA ATTTTATTGC TCATGATATG GATTATCTAG TTGATAGGTT AAAACATGGG
TGTGTAGCAG AACAAATTGA ACTAGGTGGA CAACATTTTG CTTCTTTGAA AGCAAATTTA
GGTGATGGTG GGATTCCCAT ATAG
 
Protein sequence
MNASEILAKL TQQGVQFWVE NNKINIRSPK GVITSTIKAE IATYKGDILA LLQEMNLVTK 
SASEPLSQGI SLPTIGRLIG GFAGESPVGY QPPIINPKLM AQNLNVTFRP LPDGYHNQII
MKFRQELALK LKSFGVNVLS WQEATTDIFY DIRIPILNLN CSFKIKGVRA EIDAVIDVER
PNSWLRKLGI FIAETFYKLS YPWLLNQQKM SVVQIAKLSS WAEDHAAKYV EDPTNTQVII
LSDINYDFIN PLTKYQEKIR IGINTLIKTF SEIVIGISPE QISILNMNLS DSTFFKSEMD
AFVSNSLIPK VFVPITPLLM SRFKIAQYNP YMSKYTPKLV KLGQELASTG LLPPGFKLAE
LIKRKSHRDI VNVIVNGRTG VSYGFVAYAE PPYYVGKPEI STYEWDKLLP VAGFSSNEIR
KNDESRRYIK IIINGEYVFK QIPDIWLVSS RSGSNKTDLN LEEDIIRIGL KDDLHLQLPV
GSLSRKSDFK PSYDIYVMLA ISLAAALYTP ELIKNGAPIV HFHGYPAFDW FKENEYCVGV
NNPSVPCGTY ESGVFNFLGL SNLASQQTKN IKLVSLIEPD HGTNFIAHDM DYLVDRLKHG
CVAEQIELGG QHFASLKANL GDGGIPI