Gene Aazo_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0549 
Symbol 
ID9338335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp573840 
End bp575345 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content41% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720175 
Protein GI298489998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.456531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCTA GGCAAAAGAC GGCTCTAGAG AAACTCCTTT CTCAGAGTTT CCCTATTTTC 
AAATGGGTAA TGACATTACT GCTAATTTTG AGTTTAACCA GTTGTACTGA AAAAGCTGGA
AGCCAAGAAG TATCTATTGA CAATCAAAAA ACAGCGTCAC AAATCTCCCA AGTATGTCGG
CAATTTTCAG AAACTGCACC ACCAGAAGTA ATTCAAGAAC TGCGCCCAAT TTTGGAACTT
TATCAGCCAC TAGTAACAAT TATGACTCCT ACGGCAGATG AAGTTATTGA AGACAATACT
ATTACAGTCC GTTTCCAGGT TACAGACTTA CCAATATTTA AAGATCCCCA ATGGCAATTA
GGACCTCATG TGCACGTAAT TATTGACAAT GAACCTTACA TAGCGGTTTA TGATCTAAAT
CAACCTTTAG TTTTATCAGA CTTGTCCGTA GGTACACATA CATTGCGTGT TTTTGCGTCG
CGTCCTTGGC ATGAAAGTTT TAAAAATGAA GGTGCTTATA CCCAAATAAG ATTTCATATT
TTTACTAAAA CTGATGATAA CAATCCTGCC TCCAATTTAC CCCTCCTGAC TTATAGTAGT
CCTAATGCCA GTTACGGTGC AGAACCTATT ATGCTGGATT TTTACTTAAC TAATGCACCG
TTGCATATTG CTGCTGAAGA TAATCCTGAT GACACAATCA GCGATTGGCG TATTCGTTGT
ACAATTAATG GTGAAAGCTT CATTCTTGAT CGCTGGCAGT CAGTTTATCT CAAAGGTTTT
ACACCTGGTA AAAACTGGGT AAAACTGGAA TTCCTTGATA ACCAAGGCAA CCCTGTTAAA
AATGTCTTTA ACAGTACAGC TAGACTTATT AATTACGAAC CTAAAGGTAA AGACTCACTT
TCAAGAATTG TCAGAGGGGA ACTTACCGCT AATGAGGTGC GCGGTATTGT AGACCCGAAT
TATACAACTA AGATTCCGGT TACTAAACCT ACACCTACTG TAACGCCCAA GCTTGAAGTT
TCTCCCACTC CTCAACCTCA AGTGGGAAAA CCACCGACTC CAGAAATTGA AGTTTCCCCC
ACTCCTCAAG CTCAAGTGGA AAAACCACCG ACTCCAGAAG TTGAAGTTTC TCCCACTCCT
CAACCTCAAG TGGAAAAACC ACCGACTCCA GAAGTTGAAG TTTCTCCCAC TCCTCAACCT
CAAGTAGAAA AACCACCGAC TCCAGAAATT GAAGTTTCCC CCACTCCTCA AGCTCAAGTG
GAAAAACCAC CGACTCCAGA AATTGAAGTT TCCCCCACTC CTCAAGCTCA AGTGGAAAAA
CCACCGACTC CAGAAATTGA AGTTTCCCCC ACTCCTCAAG CTCAAGTGGA AAAACCACCG
ACTCCAGAAA TTGAAGTTTC TCCCACTCCT CAAATACAAC CAGCACCTGA ACCGCAGGAA
ACACCAAGCC CTATTGAGTC AATACAAAAA GATCAATCAA AGCTAGAAAA AACAGGATTA
AGGTGA
 
Protein sequence
MKSRQKTALE KLLSQSFPIF KWVMTLLLIL SLTSCTEKAG SQEVSIDNQK TASQISQVCR 
QFSETAPPEV IQELRPILEL YQPLVTIMTP TADEVIEDNT ITVRFQVTDL PIFKDPQWQL
GPHVHVIIDN EPYIAVYDLN QPLVLSDLSV GTHTLRVFAS RPWHESFKNE GAYTQIRFHI
FTKTDDNNPA SNLPLLTYSS PNASYGAEPI MLDFYLTNAP LHIAAEDNPD DTISDWRIRC
TINGESFILD RWQSVYLKGF TPGKNWVKLE FLDNQGNPVK NVFNSTARLI NYEPKGKDSL
SRIVRGELTA NEVRGIVDPN YTTKIPVTKP TPTVTPKLEV SPTPQPQVGK PPTPEIEVSP
TPQAQVEKPP TPEVEVSPTP QPQVEKPPTP EVEVSPTPQP QVEKPPTPEI EVSPTPQAQV
EKPPTPEIEV SPTPQAQVEK PPTPEIEVSP TPQAQVEKPP TPEIEVSPTP QIQPAPEPQE
TPSPIESIQK DQSKLEKTGL R