Gene Aazo_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4034 
Symbol 
ID9341839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4093165 
End bp4095138 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content34% 
IMG OID 
ProductWD40 domain-containing protein 
Protein accessionYP_003722623 
Protein GI298492446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTTG ATATTGTTGG TGCTGCCGTC GATTTTGTTA CTCTTACCAC TAATCGAATT 
ATTTCAGCAT TTACCAGCCA AACACTTATT AAAGAAAATC AAATTAATCT ACCTGCCACA
GTTCAAAAAG TCAGACAAGA TCAGCGCGTA ATTCAAGCAG AGATGGATTA CTATCAGCGT
AGAGAAGCCA GAAAAAAAGA ATTTATGCAA ATTCAGGAGA CCCCCGTTAA TACAGATACA
GAGGTAATAT TATTACCTGA TAGGCAAAAG AAATTACTTA AAATCAAGCG GGAAGAAATC
GAAGATAGGA GCAAATTAAG TGCATTGTAT TTAGATTTAA GTCGAGAGAC AACTGCTAAA
GAAATTGAAT TTAAACAAAA AGAACTTCAG AGAATTTTCG ATCAGCAAAA ATGGCCTGGT
ATTTTGAGTC GTGATGAAGC ACAGAGAATT TTCATAGATG AACACAAGAA ACCACGCTTA
TTGATGTTAG TACCACCTCC AGATATTAGC GAAGACTTTC CGATTTCATT CCGTGATAGC
CTCAAAAAAG AAATTCGCAA CCAATTAAAA CTATTTTTAG AAAAATATTA TCCTCTTCAT
GGTGATTACT GCCCAGTTGA ATTTTATGGT AAATATTTTG AGCGTTCAAT TTTTGATGCC
GAAGTTAAGC AATTAGAAAC AATCTTATCC GCTATACCAA CCGCAATTAT TTACACTGAC
ATTACAGACC ACGAAGTTTA TTTAAATGTC AAATTTTGGG GTTTGCAAGA ACCAGTATCT
TTATCTTTTG AGCCTTGGAA CTGGGAAGAA GTAAAAGGAC AACTTGAAGA AGTTGGCAGC
GATCAGACTA AGAGTTTGCG AGCAATTAGA CAAACTATTG TCATTCTGCA CAAGCTGTTA
GCAGCATTTT TAGCAGATTG GTATTACCTA AACATCAACC CAAATTATGA ACCTCAGCTA
TTTAAATTAA GTGCAGATTT TCCTTCAGTC TGGACTAAAG AAATGTTGCA GAAACTAAGA
ATTATCCTAC AAAGATATAG AGCAGCTTAT AACTATGAGT TAAAAACATT AGCAAATCTT
GAAATAAAAA AACCAAAGGT TTGGCATTGT GTTGATACTC TGTATGGTCA TTCAAATTAT
GTTTTTTCAA TTGCTGTCAA TCCTCACGGA GAAACATTTG TTAGTGGCAG TGCAGATAAA
AACATTAAAA TCTGGGATAT CCAAACAGGT GAACTTATTC ACACTTTAAC TGGACATTCC
AACTATGTTT GCTCAGTGGC TTTTTCTGCT GATGGACAAA AAATTGCTAG TAGTAGCTAT
GACAAAACAT TTAAGTTATG GAATTGCTTA AAAAGCAAAA CTTTCATTGA ACATTCAGAT
TGTGTAACTT CAGTTGCCTT TAATTATGAT GGTAATACCT TGGCTACAGC CAGCTTAGAT
AAAACCATTA AAATATGGGA TTTAAACACT GAAAGGTTAA TATATACTTT AACTGACCAT
GCAAATTATA TTAACTGTGT AATTTTTACA TTGGATGGAC AAAAATTAAT TAGCTGCGAC
TCTGATAAAA CTATCAAAAT ATGGAGTGTT AAACAAGGAC TAGAAATTGT TAGCATTACT
GGACATACAG ATGCAGTCAA TACTATAGCT ATTAGCCCTG ATGGCAAAAT TTTTGCAACT
GGTAGTCATG ACAAGACAAT TAAACTTTGG TATTTGGCAA CAGCAGAACT CCTGCACTCA
TTTAATGGAC ATATAAATTC TGTCACCAGC GTTGCTTTTA GTCCTGATGG TAAAACTTTA
GTAAGTGGCA GTAGTGACAA CACAATTAAG CTTTGGAATC TGGAGTCAAA AGAATTGATA
AATACTTTTT CAGAACATTC CTCAAGCATT AATAGTGTGG CTTTCAGTGT TGACGGGAAT
AAAATCATCA GTGGTAGTGC TGATAATACA ATCAAGATTT GGCAATTTGA TTAA
 
Protein sequence
MSFDIVGAAV DFVTLTTNRI ISAFTSQTLI KENQINLPAT VQKVRQDQRV IQAEMDYYQR 
REARKKEFMQ IQETPVNTDT EVILLPDRQK KLLKIKREEI EDRSKLSALY LDLSRETTAK
EIEFKQKELQ RIFDQQKWPG ILSRDEAQRI FIDEHKKPRL LMLVPPPDIS EDFPISFRDS
LKKEIRNQLK LFLEKYYPLH GDYCPVEFYG KYFERSIFDA EVKQLETILS AIPTAIIYTD
ITDHEVYLNV KFWGLQEPVS LSFEPWNWEE VKGQLEEVGS DQTKSLRAIR QTIVILHKLL
AAFLADWYYL NINPNYEPQL FKLSADFPSV WTKEMLQKLR IILQRYRAAY NYELKTLANL
EIKKPKVWHC VDTLYGHSNY VFSIAVNPHG ETFVSGSADK NIKIWDIQTG ELIHTLTGHS
NYVCSVAFSA DGQKIASSSY DKTFKLWNCL KSKTFIEHSD CVTSVAFNYD GNTLATASLD
KTIKIWDLNT ERLIYTLTDH ANYINCVIFT LDGQKLISCD SDKTIKIWSV KQGLEIVSIT
GHTDAVNTIA ISPDGKIFAT GSHDKTIKLW YLATAELLHS FNGHINSVTS VAFSPDGKTL
VSGSSDNTIK LWNLESKELI NTFSEHSSSI NSVAFSVDGN KIISGSADNT IKIWQFD