Gene Aazo_5049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5049 
Symbol 
ID9342858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5166320 
End bp5167681 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content41% 
IMG OID 
ProductFAD-dependent pyridine nucleotide-disulfide oxidoreductase 
Protein accessionYP_003723277 
Protein GI298493100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGTTT CACCTCAAAA AAAGCCATCA CATGAGGTTG TCATCATTGG TGGTGGTTTT 
GGTGGACTGT ATGCAGCAAA GGCTCTTGCT AACACAAATG TAAATGTTAC TCTCATTGAT
AAACGTAACT TTCACCTATT TCAGCCGCTT TTATATCAAG TTGCCACAGG TACGCTATCA
CCTGCTGATA TTTCTGCACC ATTGCGTTCT GTATTTAGCA AAAGCAAGAA TACAAAAGTG
CTGCTGGGAG AAGTAAATAA TATTGATCCA AAAGCGCAAA AAGTTATTAT GGGTGATGAA
ATAATACCCT ATGATACATT AATTGTGGCT ACAGGTGCTA ACCATTCCTA TTTTGGTAAG
GATAACTGGA GAGAATTTGC TCCTGGCTTG AAAACTGTGG AAGATGCGAT AGAAATGCGT
CGCCGGATAT TTTCAGCATT TGAAGGGGCA GAAAAAGAAA CGGATCCCGT AAAAAGTCGT
GCTTTTTTGA CTTTTGTGCT TGTGGGGGGT GGTCCGACTG GTGTAGAATT AGCAGGTGCG
ATCGCAGAGT TGGCATACAA AACTCTACAA GAAGATTTCC GCAACATTAA CACTTCAGAA
ACGAGAGTTT TACTATTGCA AGGGGGCGAT CGCATTCTCC CACACATTGC ACCAGAGTTA
TCCCAAGCAG CCGCAGCAGC CTTGCAAAAG TTGGGAGTGG TTATCCACAC TAATACCAGG
GTGACAAATA TTGAAAATGA CATTGTTACT TTCAAGCAAG ATGGTGAATT GATAGAAATT
GCTTCAAAAA CTATCTTGTG GGCAGCAGGT GTTCAGGGTT CGGCACTGGG GAGAATTTTA
GCAGAACGTA CAGATGTAGA ATGTGATCAC GCTGGGCGTG TAATTGTAGA ACCGAATTTG
ACTATCAAGG GTTATAAAAA CATTTTCGTA ATTGGAGATT TAGCCAACTT CTCCCATCAA
AATGGGAAAC CCTTACCTGG TGTTGCACCC GTAGCCAAAC AACAAGGAGA GTATGTAGGT
GGACTGATTC AACTACGGCT TCAAGGTCAT ACTTTGCCAG AATTTCATTA CACCGACGTG
GGTAGTTTGG CAATGATTGG GCAAAATTTA GCTGTTGTAG ATTTAGGCTT CATCAAACTC
ACTGGTTTCC TTGCTTGGGT ATTTTGGCTA GTAATTCACA TCTACTTCTT AATCGAGTTT
GATACTAAAT TAGTAGTAGT AATTCAGTGG GCGTGGAATT ATATCACTCG TAATCGTCGC
TCTCGATTGA TTACAGGTAA AGAAGCTTTT TTAGATCCAC AACCTGTTAA CAGTAGCAAT
AATTCCCAGA CTACAGAAAA GAAGCAAGCA GTCAAGCTCT AG
 
Protein sequence
MVVSPQKKPS HEVVIIGGGF GGLYAAKALA NTNVNVTLID KRNFHLFQPL LYQVATGTLS 
PADISAPLRS VFSKSKNTKV LLGEVNNIDP KAQKVIMGDE IIPYDTLIVA TGANHSYFGK
DNWREFAPGL KTVEDAIEMR RRIFSAFEGA EKETDPVKSR AFLTFVLVGG GPTGVELAGA
IAELAYKTLQ EDFRNINTSE TRVLLLQGGD RILPHIAPEL SQAAAAALQK LGVVIHTNTR
VTNIENDIVT FKQDGELIEI ASKTILWAAG VQGSALGRIL AERTDVECDH AGRVIVEPNL
TIKGYKNIFV IGDLANFSHQ NGKPLPGVAP VAKQQGEYVG GLIQLRLQGH TLPEFHYTDV
GSLAMIGQNL AVVDLGFIKL TGFLAWVFWL VIHIYFLIEF DTKLVVVIQW AWNYITRNRR
SRLITGKEAF LDPQPVNSSN NSQTTEKKQA VKL