Gene Aazo_5064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5064 
Symbol 
ID9342873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5186852 
End bp5188300 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content37% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003723284 
Protein GI298493107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGG CTGAAACTCG AAATTACAAA GATTCTAGTA ATCCTTTACT ATCTCTCAAC 
TACGAAAGCG CCTTAGAATC TCTAGGTAAT GATTACTACG ATGAAGTTGC TGCAGAGGAA
TTTCCTCAAC ATATTCTGCG TTGGCGCAAT AATGCATTAT TACCCCGTTT AGGACTTGAT
CCACAACTAG TAAAAAATGA AGATTTTATT ACCGCTTTTG GTAAATTTCA ACAACGAAAA
CCCTTTTTAG CATTGCGTTA TCACGGTTAT CAATTTGGTG AATATAACCC ACATTTGGGT
GATGGTAGAG GGTTTCTCTA TGGACAAATA CGGGGAAGCG ATCACGAATT ATACGACTTT
AGTACAAAAG GTTCTGGGAG AACACCTTAC TCCCGTGCTG GTGACGGTAT GCTCACACTC
AAAGGTGGAG TCAGGGAAGT TCTTGCAGCC GAAGCACTTA ACCGTTTAGG GGTCAGAACC
TCCCGCTGTT TAAGCATGAT TGAAACCGGA CTAGGTTTAT GGCGTGGAGA TGAACCTTCC
CCTACTCGTT CATCAGTAAT GATACGCATG AGCAAATCTC ATATCCGGTT TGGAACTTTT
GAAAGACTGC ACTATTTAAA ACGTCCTGAT TTAACTAAGA AATTGTTAGA TCATGTGATT
GAGCAATATT ACCAGAACCT CATTAATGAA CATGATAAAT ATGCCCTCTT TTATGCAGAA
TTAGTGCAGC GAGTAGCAGA GTTAGTAGCA CAGTGGATGG CGGCAGGTTT TTGTCATGGA
GTCTTGAATA CTGATAATAT GTCAATTACA GGGGAGAGTT TTGACTATGG ACCTTATGCC
TTTATTCCTA CTTATGATTT ATACTTCACC GCTGCATACT TTGATTATTA TCGACGCTAT
TGTTACGGTC AGCAACCGAG TATTTGTCAT TTGAATTTAG AAATGCTGCA AGAACCGCTA
AAGGCAGTAA TTGATATAGC TGATTTACAA AATGGTCTAT CTAAATTTGC CGAATATTAT
CAGGTTGAAT ATCGAAATCT AATTTTGAAA AAATTAGGGC TTGGCAATTT GCATTTTGTA
GAAGCTGATG ATTTATTGGA ATTGACAATT ACCTTATTAA AAGATAGTCA AGTTAGTTAT
CATCAATTCT TCGCGGATAT GACTCTTACT TTTTCAAGTC AATGGCGAGA TGAGCCAGCT
TTTGTGATGA ATGATTCTGA AATTTTTCCA GCTTTAGGAG CATCTGCAGT TTTCCATAAT
TGGTGTGTAC TTTATCATAA AATTCTCAAT AACTTTGACC CTGAAAAAAT GGTAATAATT
GCAAAAAATT TAACTAAATA TAATCCACGA GCTAATTTAT TAAGACCTGT AATTGAATCA
ATTTGGGAAC CGATAATAGA AGCAGATAAT TGGCAACCTT TCTATGATTT AGTTCAACAG
TTTCAGTAA
 
Protein sequence
MTLAETRNYK DSSNPLLSLN YESALESLGN DYYDEVAAEE FPQHILRWRN NALLPRLGLD 
PQLVKNEDFI TAFGKFQQRK PFLALRYHGY QFGEYNPHLG DGRGFLYGQI RGSDHELYDF
STKGSGRTPY SRAGDGMLTL KGGVREVLAA EALNRLGVRT SRCLSMIETG LGLWRGDEPS
PTRSSVMIRM SKSHIRFGTF ERLHYLKRPD LTKKLLDHVI EQYYQNLINE HDKYALFYAE
LVQRVAELVA QWMAAGFCHG VLNTDNMSIT GESFDYGPYA FIPTYDLYFT AAYFDYYRRY
CYGQQPSICH LNLEMLQEPL KAVIDIADLQ NGLSKFAEYY QVEYRNLILK KLGLGNLHFV
EADDLLELTI TLLKDSQVSY HQFFADMTLT FSSQWRDEPA FVMNDSEIFP ALGASAVFHN
WCVLYHKILN NFDPEKMVII AKNLTKYNPR ANLLRPVIES IWEPIIEADN WQPFYDLVQQ
FQ