Gene Aazo_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0049 
Symbol 
ID9337832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp42970 
End bp44670 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content40% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003719826 
Protein GI298489649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTAC CTATTGTGTC TGAGAAGAAA AAAAAGAAGC GGTCTTTAGT ACTGACATTA 
TCAGCTGCGG CGTTATTGAT TGGTGTAGGC AGTTTTGCTC ATTGGTTTTT TACCCAAGGA
CGACCGTTTT CTAGAAGTTT GCCAGTGGGT GCAAATATTA TTCCTCAAGA TGCCTTGTTT
GCAGTTTCTT TAACAACAGA TACTAAACAA TGGCAAAAGT TGAGGGAATT TGGGACACCA
GAAACTCAAA AGGAACTGGA TAAAAATTTG GTGCAACTGC GTGATCGCTG GTTAACCAAT
AATGGCTACA ATTTCGAGAA GGATATTCAA CCTTGGGTGG GGGATGAGGT CACCATTGCT
GTTTTGCCTC CCCCAGTAGT CAAGCCAGTG CTGAAACCAG TAGCTACTGA GGCTAATATT
CCTTATGAAC AGTCAATGGT GATGGTGCTG CCAATCAAAA ACCCAGCAAT TGCTAAAAAG
ATGTTGGCAC AATCTCAAAC CCTTAAACAA GGTAAATCGA CTGAGCTTAC TTATCGGGGA
ATCGCAATTA AGCAAACTGA AGGAAAAGCT GGAGAAAAGC TGTCAGCAAC ATTAATAAAT
CAGCAATTGC TTGTAATTAC AGATAATGCT AAAGCCACAG AAAAAACAAT TGATGCTTAT
AAAAATCAAA CATCTGTAGC AACATTAGTA GGCTTTGCAG AAAATTTCTC AAAAACCTCT
AGTTATCAAC CGTTTGCTCA ATTTTATATA AATGTACCCT TGGCTGCAAA AATAGCGGCG
ACAGCCCCTA ATCGACGTTT ACCTGTTCAG GTTCTTGCCC AACTTCAGAA TAACCAAGGT
TTAGCGGGAA CTCTGAACTT AGAATCTGAA GGAGTGCGTT TAAAGGGTAT TTCTTGGTTA
AGCCCTAATA GTCAAAGAGT TTTGGCGGTA GAAAATAAAG CTGGAAGTAT GCAAAATCGC
CTCCCCAGCG AAACCTTAAT GATGTTGTCT GGTAGTAACT TAAAGCGGTT GTGGGCAGAC
TATGTTTCTA CTTCTGGGGG AAATCCGTTG GCACCAATCA AACCCGAAGA ACTGCGACGG
GGTGTAAAAT CTTTAACAAA TTTGGATTTA GATCAAGATT TACTGAGTTG GATGAAGGGG
GAATTTTCAG TTTCGGTAAT TCCTAATACT TCACGAGATG GTTCACCGGA TAACTTTCGG
GCTGGTTTAG TATTTATGGT TAAGGTAAGT GATGGTAAAG CTGGGGTACG GCAATCGGCT
GAAACTGCTT TACAAAATCT TGATGATGTG CTGAAAAATC AATACCAGTT TAAAGTTGAA
TCAGCTACTG TTGGGGGTAA ACCCGTTGTT AACTGGATTT CACCTTTTGG GACTTTAACG
GCTACTCATG GTTGGTTAGA TGACAATGTG GTTTTTTTCG GCTTCGGCGC TCCCATCAGC
GATAAAATTG TTCCTAAACC CAACAATACT CTAGCCAACA CTCTACGTTT TCAACAAACC
GTTCCCAAGG AATTAAATCC AGCCAAGGGT CAATTTTTCT TGGATATGGA ACGGACTGTT
AAAAGTTTTC CTCTCAATCT TGAATCTCCT GGTCAACAAG CATTACTTTC TGCTATACAA
ACTATAGGTA TAACAACTGC TGTCAACGAT AATCGTAGTC AGGAATATGA CATTTTTGTG
GAACTGAAAA AAGGTAAATA G
 
Protein sequence
MTLPIVSEKK KKKRSLVLTL SAAALLIGVG SFAHWFFTQG RPFSRSLPVG ANIIPQDALF 
AVSLTTDTKQ WQKLREFGTP ETQKELDKNL VQLRDRWLTN NGYNFEKDIQ PWVGDEVTIA
VLPPPVVKPV LKPVATEANI PYEQSMVMVL PIKNPAIAKK MLAQSQTLKQ GKSTELTYRG
IAIKQTEGKA GEKLSATLIN QQLLVITDNA KATEKTIDAY KNQTSVATLV GFAENFSKTS
SYQPFAQFYI NVPLAAKIAA TAPNRRLPVQ VLAQLQNNQG LAGTLNLESE GVRLKGISWL
SPNSQRVLAV ENKAGSMQNR LPSETLMMLS GSNLKRLWAD YVSTSGGNPL APIKPEELRR
GVKSLTNLDL DQDLLSWMKG EFSVSVIPNT SRDGSPDNFR AGLVFMVKVS DGKAGVRQSA
ETALQNLDDV LKNQYQFKVE SATVGGKPVV NWISPFGTLT ATHGWLDDNV VFFGFGAPIS
DKIVPKPNNT LANTLRFQQT VPKELNPAKG QFFLDMERTV KSFPLNLESP GQQALLSAIQ
TIGITTAVND NRSQEYDIFV ELKKGK