Gene Aazo_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1061 
Symbol 
ID9338857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1134490 
End bp1135926 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content42% 
IMG OID 
Productpolysaccharide export protein 
Protein accessionYP_003720541 
Protein GI298490364 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0167243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATA TACATTTGTG GAAAATTTTC AGTCATTCAA CTACAGCTGT AGTTTTATTA 
ACAACAGTTA ATATTGCTTT ATCATCTCTC AGCCTAGCCC AGACACAAAA ACTATCAGCA
GCGTCAACCA TATCCACAGA TTACTTATTA GGCGGTGGCG ATCGCATTCG CGTCAATGTC
TTTGAAGTAC CAGAATATAC AGGTGAGTAC CAAGTTCCCC CTGGTGGTGC GATTAATCTG
CCTTTAATTG GCAGTATATC TGTTCTGGGA TTAACAACAG AACAGGCAGC AGATGAAATA
GCCAGAAGAT ATGCTCGTTT TCTGAAACGT CCCCTGATTT CCGTGAATTT GTTATCATCT
CGTCCTATTA ATGTTTTTGT AGCCGGAGAA GTAACAAGAC CAGGATCTTA CACTCTCAGC
TTACAGGGAA GTGGAGGCGA TAATCCTGGT GTACAATACC CGACCGTATT AGCTGCGCTG
ACTACAGCCC AAGGGGTAAC ACTAGCAGCA GATGTAAGCA AAGTACAATT ACGCCGTAAA
ATAGGACGTT CTGGTGAACA AGTTATTAGT TTTAACTTAA AAGAAATCAC CAAAACAGGT
AAGATACCCC AAGATATTAC CTTACGGGAT GGAGACACAA TCGTTGTACC CACCGCCACG
AACTTCAACG TTGCTGAAGC CCGAAATTTA TTTGCTGCTA ACTATGCAGC TAGTCAAAAC
GCACCTCGCA CAGTTGCTAT TACAGGACAA GTTTACCGTC CTGGTTCTTA TCTGGTGACA
CCAGGTTCTT CTAGTTCGGA AGCAGGTACT GTAGCCCCTG GAAGTGGCTT ACCAACTTTA
ATGCGGGGAA TTCAACTAGC CGGAGGAATT ACATCACAAG CTGATGTAAG GAGTATTAAA
ATCCGTCGTC CTACAAGAAT TGGCTCAGAA CAAACTTTAA ATATTAATCT CTGGGAATTG
TTGCAAACTG GGGATCTCAA TCAAGATGTC GTGTTGCAAG ATGGAGATAC AATTGTAGTC
CCCACAGCAA CTGAGATTAA CACAGCAGAA GTGACCCAAT TAGCTACCAC TACTTTGTCA
CCTGCAACTA TTCAAGTTGG GGTAGTAGGA GAAGTGAAAA AACCTGGATT AACAGCTTTA
CAACCTAATA GCTCTTTAAA TCAGGCTTTG CTGGCTGCTG GAGGTTTCAA TGATGCTAGG
GCTAGTAGTG CTGCTGTAGA TTTGATTCGT CTCAACCCCA ATGGCACTGT TAGTAAACGG
GTAGTAAAAA TAGATTTCTC AAAGGGAATT AATGACGAAA CGAATCCTAT ACTTCACAAT
AATGATGTTG TCCTAGTTAG CCGTTCTGGT ATTGCTAAGA CTAGTGATAC AGTCAATACT
GTAGCTAGTC CTTTGGGTAC TCTTTTAGGC ATTGTTAGGA TATTTTTTGG ACTCTAG
 
Protein sequence
MLNIHLWKIF SHSTTAVVLL TTVNIALSSL SLAQTQKLSA ASTISTDYLL GGGDRIRVNV 
FEVPEYTGEY QVPPGGAINL PLIGSISVLG LTTEQAADEI ARRYARFLKR PLISVNLLSS
RPINVFVAGE VTRPGSYTLS LQGSGGDNPG VQYPTVLAAL TTAQGVTLAA DVSKVQLRRK
IGRSGEQVIS FNLKEITKTG KIPQDITLRD GDTIVVPTAT NFNVAEARNL FAANYAASQN
APRTVAITGQ VYRPGSYLVT PGSSSSEAGT VAPGSGLPTL MRGIQLAGGI TSQADVRSIK
IRRPTRIGSE QTLNINLWEL LQTGDLNQDV VLQDGDTIVV PTATEINTAE VTQLATTTLS
PATIQVGVVG EVKKPGLTAL QPNSSLNQAL LAAGGFNDAR ASSAAVDLIR LNPNGTVSKR
VVKIDFSKGI NDETNPILHN NDVVLVSRSG IAKTSDTVNT VASPLGTLLG IVRIFFGL