Gene Aazo_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1775 
Symbol 
ID9339568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1842755 
End bp1843924 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content38% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003721023 
Protein GI298490846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATAT TGTCTATTGA TGAACTTAAA ATATTAGTCG CAAATCCCCA ACCCCCCTGT 
GTATCTCTGT ATATGCCTAC ACAAAAAGCA GGGGCAGAAA TCCGACAAAA TCCCATTCGC
TTCAAAAATT TAATACGGGA AGCTGAGGAA CAGTTAGATG CGCTAGGAAT ACGCCATACA
GAGGCGGTGA ATTTTCTCCA GTCGGCAATG GAATTGGATA GAGGTGATTT CTGGGAACAT
CAAGATGAGG GATTGGTGGT TTTTATTGCC GAAAATCTTT TTCGTTACTA TTGCTTACCA
AGAGACTGTT CGGAATTAGT AGTTGTGGGT CAGCAATTTC ACGTTAAACC CTTGCTACAT
TTAATTAATA ACGATGGTAA TTTCTACGTT CTGGCTTTGA GTCAAAAAAA TGTCAAATTT
TTTGCAGGGA CAGGCTATAG TTTCAACGAA GTTCAAGTAG AAAATATGCC ACAAAACTTA
GAAGCAACTC TTTTAGAAGA TGTGCTACAA AAAGGTGTAC AACATCGCAT TGCCAAATCT
AAAGGGGGAA CGTCTAATCC TTTTCAACAC CCAGGTTCAT TTCATGGACA AGGAAGCCCT
GATCAAGATA GGCATCTAGC AGATATCTTA CAATTTTGTT ACGCTGTTGA TACTGTATTG
CATGAAAAAT TAAGGGAGGA AAAAGCGCCT TTAGTATTAG CAGGAGTTGA GTATCTATTC
CCTATTTACC GACAAGCAAA TACTTATCCC CATTTATTAG CAGAAAGTAT TAATGGTAAT
CCAGAAACTA TCACCTCAGA ACAACTACAT GATGAGGCTT GGCGAATTGT TTCACCTTCA
TTTCAAGAAA ATAAAAGGGT AGGAATAGAA CTTTATGAAC GATTAGTTGG TGAAGATACT
GGAAGGGCTA CTAATAATAT CAAAGAAATT ATTCCAGCAG CTTACTATCA CCGAGTTGAT
ACTTTGTTTG TATCGGTATC TGAACAGAAA TGGGGGAAAT TTGATTCAGA AAATACGATT
GTGGAGTTAC ACACCGAACC AGAACCAAAT GATGAAGATA TGTTAGATTT TGCTGCTGTT
CACACGCTGT TAAATGGTGG CAGAGTTTAT ACTCTTGAAT CTCAATATAT GCCAAATGGG
TCAACAGTGG CAGCAATTTT TAGATATTGA
 
Protein sequence
MAILSIDELK ILVANPQPPC VSLYMPTQKA GAEIRQNPIR FKNLIREAEE QLDALGIRHT 
EAVNFLQSAM ELDRGDFWEH QDEGLVVFIA ENLFRYYCLP RDCSELVVVG QQFHVKPLLH
LINNDGNFYV LALSQKNVKF FAGTGYSFNE VQVENMPQNL EATLLEDVLQ KGVQHRIAKS
KGGTSNPFQH PGSFHGQGSP DQDRHLADIL QFCYAVDTVL HEKLREEKAP LVLAGVEYLF
PIYRQANTYP HLLAESINGN PETITSEQLH DEAWRIVSPS FQENKRVGIE LYERLVGEDT
GRATNNIKEI IPAAYYHRVD TLFVSVSEQK WGKFDSENTI VELHTEPEPN DEDMLDFAAV
HTLLNGGRVY TLESQYMPNG STVAAIFRY