Gene Aazo_3882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3882 
Symbol 
ID9341686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3935372 
End bp3936604 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content41% 
IMG OID 
Productmonooxygenase FAD-binding protein 
Protein accessionYP_003722513 
Protein GI298492336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACAA ATCACAATCT TCCTTTAGAA CACGAGATTT TAGATGTACA AAATACAGAT 
TGTTGTATTG TTGGTAGTGG TCCTGCGGGG GCTGTACTGT CTCTACTTTT AGCACGTCAA
GGAATTTCGG TGTATCTGCT AGAAACACAC AAAGATTTTG ACCGGAATTT TCGTGGTGAT
ACTATTCATC CGGCGATAAT GCAAATTATG GAAGAGTTGG GTTTAAGTAA TAGCCTTTTG
CAATTACCCC ACACCAAAAT GCACCGTATT CAAATGAAAA CTTCCCAAGA AACCATCACT
TTTGCAGATT TTAGTCGCCT GAAAACCGCC TACCCTTATA TTATGATGCT TCCCCAGGCG
CGGTTTTTAG AATTTATCAC CCAAGAAGCA AAAAAATACC CTAATTTCCA TTTAATTTTG
GGTGCAAATG TGCAGGAATT AATTACAGAA AATGGTATAA TTCAAGGTGT GCGTTATCGT
GGACAAGGCG GTTGGCATGA AGTACGCGCT ACATTAACAA TAGCTGCAGA TGGTCGTCAC
TCAAAACTAA GACAATTGGG TGATTTTGAA TCTGTAGAAA CTTCGCCACC GATGGATGTG
CTTTGGTTTC GTCTCCCGCG TCAGCGAGAA GAATTTGCAG GGGGTATGGC GCTCTTTGGT
TCTGGTAACA TCTTGGCTAT GCTAGATCGT GGTGATGAGT GGCAAATTGG CTATGTTATC
CCCAAAGGTA GTTATCAACA ACTGCGTGCT ACTGGTTTAG CAGAATTAAA AAAATCAATT
ATTGATACAG TACCAGAATT AAGTGATCGC ATCGAACTGT TACAAGATTG GTCGAAAATA
GCTTTTCTTT CTGTCGAATC AAGTCGCGTT AAGCGTTGGT ATCGTCCCGG ACTGTTACTC
ATAGGTGATG CAGCCCATGT CATGTCACCA GTGGGTGGTG TCGGTATAAA TTACGCAATT
CAAGATGCTG TCGTCACCGC AAATATATTG AGTCAACCAC TTAAACAACA TCGTGTAGAA
ATCAGCGATT TAGCAAAAGT ACAGCAGCAC AGAGAGCTAC CTACACGAAT CATTCAAGCA
TTTCAGGCTT TGATCCAAAA GCGGATATTT ACCCCCATTC TTACTGAAAA TCAAACTTTT
CAACCGCCTC TATTGCTACG TTTACCTATC TTACGTGATA TCCCCGCTCG ATTAATTGCT
CTGGGTATTT TTCCTGTTCA TGTCCAGATT TAA
 
Protein sequence
MSTNHNLPLE HEILDVQNTD CCIVGSGPAG AVLSLLLARQ GISVYLLETH KDFDRNFRGD 
TIHPAIMQIM EELGLSNSLL QLPHTKMHRI QMKTSQETIT FADFSRLKTA YPYIMMLPQA
RFLEFITQEA KKYPNFHLIL GANVQELITE NGIIQGVRYR GQGGWHEVRA TLTIAADGRH
SKLRQLGDFE SVETSPPMDV LWFRLPRQRE EFAGGMALFG SGNILAMLDR GDEWQIGYVI
PKGSYQQLRA TGLAELKKSI IDTVPELSDR IELLQDWSKI AFLSVESSRV KRWYRPGLLL
IGDAAHVMSP VGGVGINYAI QDAVVTANIL SQPLKQHRVE ISDLAKVQQH RELPTRIIQA
FQALIQKRIF TPILTENQTF QPPLLLRLPI LRDIPARLIA LGIFPVHVQI