Gene Aazo_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1836 
Symbol 
ID9339629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1904695 
End bp1906263 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content37% 
IMG OID 
Productsecretion protein HlyD family protein 
Protein accessionYP_003721062 
Protein GI298490885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.161085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAC TTAATGCTAA CCAAATCAAT AGCAACGGCA ATGGAAAAAA TCAAAGTCTA 
CAGTTGCTGA CACCTCGCAA AAAAGCTAAA ATAGCATCCT TAACTAATGC CTATAAGCAG
GATGAATTTG AACAGTCTAT AGTCTTATGT CAATCACCAA TATGGTCACG TACCATCATG
ATTACCTTGA TGGTTGTAGC CTGTTTTGGA GTTGGGTGGG CTTATTTTTC CAAACTTGAA
CAAGTAGTTC CAGCAACAGG TCAATTAAAA CCAGAGGGAA CAGTCAAAGA AGTACAAGCT
CCCATTAATG GAGTTGTAAA ATCTGTTTAT GTAAAAGATG GTCAAACCGT AAAAAAAGGA
GACTTGATCT TAACATTTGA ATCGGTTGCA ACTTTAGCTG AGTTAAGTTC CTTAAATAAA
ATTCGCGTTG CTTTAACTAA AGAAAACGAT ATTTATCGTC GCTTGATGGG AGCAAGCACA
GGTATCACCT CAGAGTTGGA CTTTTTACGT AGTAACTTGT CACCAGAATC TGCTTTTCTC
CTTAAACATC GAGCATCATT AGTAACAGAA AATGAACTAC TGCGTTCTCA ATTAAAGAAT
ACTCCACCAG AAAATAGCAA CGGAATTGAT GAACAACAAC GCCTCATAGC AGCGAAGAGG
GAATTAGATT CCCGATCTAG CGCAGCTAAA TTAGAAGTTG AAAAAATCAG GAAGCAACTA
TCACAAACCA TCGTCAAAAT AAGAAATACT CAAGATAGTT TAGCCATTCA AACACAGATT
TTGGATAAAC TCAAAATATT AGCAGTCGAA GGTGGAATTT CTCAACTGCA ATATCTCAAT
CAGCAACAAC AAGTACAAAC TTTAAAAGCA GAAATATCAC AATTAACTGA GGAAGAAAAA
CGCCTCCAGC TTGATATTCA AAAAGGACAG CAGGAAGTAA CTAATACAGT AGCAGTTACT
GATAAAAACG TTCTGGAGCA GATAGCTAAC AACAAAAAGA GGATTGCCGA AATAGACAGC
CAATTTATGA AGATTATTCT GGATAATGAG CAGAAATTGG GAGATATTAA CAGTAAGATT
TCCCAGACGC AATTAAATGT TAGATATCAA GAGGTCCGTG CTCCTATAGA AGGGACAGTG
TTCGATATGC AAGCCAAAAA TCCTGGGTTT GTAGCAAACA CCACCCAAAA ATTATTGCAA
ATTGTACCTA ATGATAAATA TGTTGCTGAA GTATTTATCA CCAATAAAGA TATTGGATTT
GTAAGGGTAG GTATGAACGT AGATGTGAGA ATTGATTCCT TTCCTTTTAG CGAATTTGGA
GATATTAAAG GTCAGGTGAT TGATATCGGT TCAGATGCTT TACCCCCAGA TCAAATTCAT
CAATTTTATA GATTTCCAGC CAGAGTTAGC TTGCATAAAC AAAAACTAGA AACTCAAGGC
AAAAAGATAG CATTACAGTC TGGGATGTCA ATTACCGGTA ATATTAAAGT TCGCGAGGAA
CGTACTGTAC TTAGTTTGTT CACGCAGATG TTTACCAAGC AAGTGGAGAG CTTGAACGAA
GTGCGTTAA
 
Protein sequence
MTQLNANQIN SNGNGKNQSL QLLTPRKKAK IASLTNAYKQ DEFEQSIVLC QSPIWSRTIM 
ITLMVVACFG VGWAYFSKLE QVVPATGQLK PEGTVKEVQA PINGVVKSVY VKDGQTVKKG
DLILTFESVA TLAELSSLNK IRVALTKEND IYRRLMGAST GITSELDFLR SNLSPESAFL
LKHRASLVTE NELLRSQLKN TPPENSNGID EQQRLIAAKR ELDSRSSAAK LEVEKIRKQL
SQTIVKIRNT QDSLAIQTQI LDKLKILAVE GGISQLQYLN QQQQVQTLKA EISQLTEEEK
RLQLDIQKGQ QEVTNTVAVT DKNVLEQIAN NKKRIAEIDS QFMKIILDNE QKLGDINSKI
SQTQLNVRYQ EVRAPIEGTV FDMQAKNPGF VANTTQKLLQ IVPNDKYVAE VFITNKDIGF
VRVGMNVDVR IDSFPFSEFG DIKGQVIDIG SDALPPDQIH QFYRFPARVS LHKQKLETQG
KKIALQSGMS ITGNIKVREE RTVLSLFTQM FTKQVESLNE VR