Gene Aazo_1951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1951 
Symbol 
ID9339744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2030564 
End bp2033530 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content38% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003721160 
Protein GI298490983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTGGA AATGGTGCTT TAGAGTTATT ATCGCTTTGT TGGGGCTGTG GCTATTCTGG 
GATATAGTTT CCCATCTAGG GGCAGAGATT TTTTGGTTTC AGGAAGTTGG CTATTTGCAA
ACATTTCTGC TGCGCTTGGT GACTAAGGGT GCTTTATGGG TGGGTGTGGT CAGTGTTAGT
TTTACCTATC TGTTTTTTAA TCTGGCTTTA GCACAACGCT GGAAATATCC CCAGTCTCTG
AAAACTGAGT TGGTCAGAAA TGAGGAGACA AGACTGAGTA AGCAACTCAC AAAATTTCTC
AGTCCACAGT ATGGCACAGT CTATAAACCT CCTTATGTTG AGACTGGATA CAAAGAAATT
AGATTGCGCT GGCTGTTACC ACTGACGTTA GGCTTGAGTT TGTTGGTGGG GTTGATGGTA
ACCCATTATG GACAAGTCGC TCTAAGTTAC TGGAATAGAG AAGTTAATCA GGTTATTTCA
CCTTTTACAA TTCTATTTAG ACCAGAGATT CTTTGGAATT TCGGACACAC AATTATTTTT
CAGGACTGGT ATTTTGGTGT GGCTTTAGTA ATGGCGATCG CACTCTTAAT ATATTCTCAC
TTTTTGCTGA GAGCGATCGC ACTTATTTTT AGTCTTGGTT TTGGCTGGAT ACTATCTCAA
CATTGGTGTA AAATACTACT TTATTTCCAC TCTACTCCCT TCAATACCAA CGACCCTTTA
TTTGGCAAAG ATATCGGTTT TTATATTTTT TCACTCCCTC TGTGGGAACT TTTGGCATTT
TGGCTACTTG GATTATGTTT ATATGGCTTT GTGTCCGTTG GTCTCACCTA TCTTTTATCA
GGAGATAGTC TAAGTCAAGG CATTTTTCCT GGTTTTTCTC TACCACAACA ATGTCATTTA
TTCGGTTTGG GTAGCTGTTT GATGTTGGTA GTAGCCTTTA GTTATTGGTT AGGTCGTTAT
GAACTGGTTT ATTCTCCCCG TGGGGTGAGT TTTGGCGCTA GTTACACCGA TGCTAAGATC
CAGTTGCCAG TTGACACTAT GTTATGTGTT TTAGCTGTAG TGCTCGCATT TTATTTATTA
TGGCAAACTT TCTTCTGGAA ACCCAAATCG CAACATCATC GTTGGGCAAT TTATAGTTTA
TGTATTTATA TAATTTTAAT AATTACAGGT GATTTTCTTG TACCTGCTGT TGTCCAATCT
TTAATAGTTC AACCCAATGA ATTGCAGCGA GAGAAGCCTT ACATCCAACG TACTATTAAC
TTTACTCGTC AAGCATTTGA TTTAGAAGCA GTTGATTCTC AAACCTTTAA TCCCCAGGGA
AACTTAACCC AAGCTGATCT CAAAGTCAAT GATCTAACAA TTCGTAATAT TCGTCTTTGG
GATCAAGAAC CATTATTAAA AACTAACCGC CAATTACAAC AAATTCGTCC CTATTATCAA
TTTCCCGATG CTGATATTGA TCGTTACACA ATAAAAATTG AACCCAACCA AGAAGCAACC
GCATCAACCG AAAAACGTCA GGTACTAATT GCAGCCAGGG AATTAGACTA TAATTCTGTT
CCACAGCAAG CTAAAACATG GGTAAACCGC AATTTAGTTT ACACCCACGG TTATGGCTTT
ACCATGAGTC CTGTGAATAC TGTTGCTCCT GGTGGTTTAC CAGAATATTT TGTTAAAGAC
ATTAGCGGTA ATGATAGTGC GCTAACTACT TCTAGTGAAG CTATTCGTGA AAGTATTCCT
ATTGGTAAAC CGCGAATTTA TTATGGAGAA ATTAGCAATA CTTATGTCAT GACTGGGACG
AAAGTTAGAG AATTAGATTA TGCCAGTGGC AGAGATAATG TTTACACCAG TTATGATGGC
GTGGGTGGAA TTAGAATTGG TTCTCTGGGT CGAAGATGGC TATTTGCTAC ATATTTGAAA
GACTGGCAAA TGATATTTAC ACGGAATTTT CTACCAGAGA CAAAAGTATT ATTCCGGCGA
AATATTAACC AACGAGTTCG TGCGCTCGCA CCTTTTCTAA AATTTGACAG TGAGCCTTAC
TTAGTAGCTG CTGATGCCAA TCCTGACAAC AAAAATGAAC AATTCCCGGC AACAAAGAAT
AATCTTTACT GGATTATAGA TGCTTATACA ACAAGTGACC GCTATCCCTA CTCTGACCAA
AATGGTGATG GCATCAACTA TATTCGTAAC TCCGTTAAGG TCGTAATTGA CGCTTACAAC
GGCACAGTCA GATTTTACGT TGCTGAACCA AAAGAGCCTC TAATTATTGC CTGGTCGAAA
ATATTCCCAC AAATGTTCCA ACCGCTGGCA AATATGCCTG TTAATCTCCA GAGTCATATC
CGCTATCCAG TGGACTTTTT TAAAATTCAA TCTGAACGGT TAATGGTTTA TCATATCACT
AACCCCCAAG TATTTTACAA CCGCGAAGAT CAATGGCAAA TTCCCAATGA AATATATGGA
ACGCAAACCC GTCAGGTTGA ACCTTACTAT TTAATTACTA GTCTTCCTAA CGTTCCCTTT
GAAGAATTTA TTTTACTGCT TCCCTACACA CCCAAACAAC GAACAAATTT AATTGCTTGG
TTAGCAGCGC GTTCAGATGG AGAAAATTAC GGTAAATTGC TGCTGTATAA CTTTCCTAAA
GAACGGTTGA TTTATGGGAC AGAACAAATA GAAGCGAGAA TTAACCAAGA TCCAGTAATT
TCTCAACAAA TTTCTTTATG GAATCGTCAA GGTTCGAGAG CAATTCAAGG TAATCTTTTG
ATTATTCCCA TTGAACAATC TTTGTTATAT GTTGAGCCAA TTTATTTAGA AGCAACTCAG
AATAGTTTAC CAACTTTGGT TAGGGTGGTT GTTGCTTATG AAAATCGCAT TATCATGGCA
AAAACTTTAG AACAAGCATT ACAAGGAATT TTTCAACCAG AGGTAACACA AGCACCGACA
ATTATTCGTC CTGTGGAACA AGAATGA
 
Protein sequence
MSWKWCFRVI IALLGLWLFW DIVSHLGAEI FWFQEVGYLQ TFLLRLVTKG ALWVGVVSVS 
FTYLFFNLAL AQRWKYPQSL KTELVRNEET RLSKQLTKFL SPQYGTVYKP PYVETGYKEI
RLRWLLPLTL GLSLLVGLMV THYGQVALSY WNREVNQVIS PFTILFRPEI LWNFGHTIIF
QDWYFGVALV MAIALLIYSH FLLRAIALIF SLGFGWILSQ HWCKILLYFH STPFNTNDPL
FGKDIGFYIF SLPLWELLAF WLLGLCLYGF VSVGLTYLLS GDSLSQGIFP GFSLPQQCHL
FGLGSCLMLV VAFSYWLGRY ELVYSPRGVS FGASYTDAKI QLPVDTMLCV LAVVLAFYLL
WQTFFWKPKS QHHRWAIYSL CIYIILIITG DFLVPAVVQS LIVQPNELQR EKPYIQRTIN
FTRQAFDLEA VDSQTFNPQG NLTQADLKVN DLTIRNIRLW DQEPLLKTNR QLQQIRPYYQ
FPDADIDRYT IKIEPNQEAT ASTEKRQVLI AARELDYNSV PQQAKTWVNR NLVYTHGYGF
TMSPVNTVAP GGLPEYFVKD ISGNDSALTT SSEAIRESIP IGKPRIYYGE ISNTYVMTGT
KVRELDYASG RDNVYTSYDG VGGIRIGSLG RRWLFATYLK DWQMIFTRNF LPETKVLFRR
NINQRVRALA PFLKFDSEPY LVAADANPDN KNEQFPATKN NLYWIIDAYT TSDRYPYSDQ
NGDGINYIRN SVKVVIDAYN GTVRFYVAEP KEPLIIAWSK IFPQMFQPLA NMPVNLQSHI
RYPVDFFKIQ SERLMVYHIT NPQVFYNRED QWQIPNEIYG TQTRQVEPYY LITSLPNVPF
EEFILLLPYT PKQRTNLIAW LAARSDGENY GKLLLYNFPK ERLIYGTEQI EARINQDPVI
SQQISLWNRQ GSRAIQGNLL IIPIEQSLLY VEPIYLEATQ NSLPTLVRVV VAYENRIIMA
KTLEQALQGI FQPEVTQAPT IIRPVEQE