Gene Aazo_5170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5170 
Symbol 
ID9342977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5294747 
End bp5295970 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content42% 
IMG OID 
Productmolybdenum cofactor synthesis domain-containing protein 
Protein accessionYP_003723345 
Protein GI298493168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCAG TCAGAGATGC AGAAGCTACT ATTTTCAATG CTATACAACC GCTGGATAAC 
CAGCAGGATA TAGAATTTGT CGATTTGTTG ATGGCAAATA ATCGTATTTT AGCCACTCCT
GTTACCAGTT CCTTCGATTT TCCCCATTGG GATAATTCGG CAATGGATGG TTATGCTGTG
CGTTATGCAG ATGTGCAGCA AGCAAGGGCT AATAAACCCA TTATTTTGAC AGTTGTGGAA
GAAATTCCCG CCGGATATCA ACCCCAAGTG ACTATTAAAC CAGGAGAAGC GGCGCGAATT
TTTACAGGTG CGGTGATGCC AACAGGTGCG GATACTGTTG TTATGCAGGA AAAGACTCAC
CAGGAAGAAA ACCGCATTTT TATCTTTGCT GCACCTCAAC TAGAAGAGTT TGTTAGACGC
AAGGGTGATT TTTACCAAGC TGGAAAGCAA CTGTTACCCG CAGGTATTAG TTTAAATGCT
TCTGAAATTG GGGTTTTAGC TGGGGCAGGA CGTGAGCAAG TCTGTGTTTT CCGTCGTCCC
CGTGTGGCGA TTCTTTCCAG TGGTAATGAG TTGGTGATGC CGGAAGAAAT GCTCAAACCT
GGGCAAATTG TTGATTCTAA TCAGTATGCT TTGGCTACTT TGGTAAGGGA ACTGGGTGCG
GAAGTGTTAC TGTTAGGAAT TGTTAAAGAT GATCCTACGG CTTTAAAAGA AATTATAGAT
TATGCGATCG CCAACGCTGA TATAGTTATT TCTACTGGTG GTGTATCTGT GGGCGATTAT
GACTACATAG ATAAGATTTT AGTGTCTCTG GGGGCAAAAG TTCACTTTAG CTCTGTGCAA
ATGCGTCCGG GAAAACCTCT GACTTTTGCA ACTTTCCCCA ATTCATTATA CTTTGGTTTA
CCTGGAAATC CTGTTTCTGG TTTGGTTACT TGCTGGCGGT TTGTACAACC AACAATTAAA
AAACTGGCGG GACTTTCTAA AGGTTGGGAA GGAAAATTTT TGAAAGTGCG ATCGCATTCA
GAATTACAAT CAAATGGTAA GATGGAAACT TATGTTTGGG GTAAGTTACA TCTGGTAAAT
GGTGGTTATC AATTTCACAA AGCCGAGGGT AATGATAGTT CGGGTAATTT AATTAATTTA
GCGCAAACAA ATGCTTTGGC TGTCTTACCT GTGGGTAAAA CTTTGGTTTA TTCTGGTGAG
GAAGTTTTCG TTTTGCAGCT ATAG
 
Protein sequence
MLSVRDAEAT IFNAIQPLDN QQDIEFVDLL MANNRILATP VTSSFDFPHW DNSAMDGYAV 
RYADVQQARA NKPIILTVVE EIPAGYQPQV TIKPGEAARI FTGAVMPTGA DTVVMQEKTH
QEENRIFIFA APQLEEFVRR KGDFYQAGKQ LLPAGISLNA SEIGVLAGAG REQVCVFRRP
RVAILSSGNE LVMPEEMLKP GQIVDSNQYA LATLVRELGA EVLLLGIVKD DPTALKEIID
YAIANADIVI STGGVSVGDY DYIDKILVSL GAKVHFSSVQ MRPGKPLTFA TFPNSLYFGL
PGNPVSGLVT CWRFVQPTIK KLAGLSKGWE GKFLKVRSHS ELQSNGKMET YVWGKLHLVN
GGYQFHKAEG NDSSGNLINL AQTNALAVLP VGKTLVYSGE EVFVLQL