Gene Aazo_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3647 
Symbol 
ID9341452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3713517 
End bp3715244 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content40% 
IMG OID 
ProductDNA repair protein RecN 
Protein accessionYP_003722337 
Protein GI298492160 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCTGT GCCTCAGAAT TGAGAATTTT GCCCTCATCG ACCAACTAGA GTTAGACTTT 
GGCGCTGGGT TGAATGTACT AACAGGGGAA ACCGGCGCGG GAAAGTCGAT TATTTTGGAT
GCGATTGATG GGGTTTTAGG TGGGAAAGTC TCTAGTCGTG TGATTCGCAC GGGGACTAGT
CGCGCTTTAG TAGAAGGGAC TTTTAGTATC AATCCTTTTC TGGCTGCTTG GTTGAGTGAA
CAGGAAATTG ATTTAATTGA TGATAATGCT GTAGTTATTA GTCGAGAAAT TGCCGCAACT
GCCAGTAATA TCCGCAGTAG GTCGCGGGTA AATGGTGTGT TGGTAAATCG GCAAATAATG
GGAAGTTTGC GCGATCGCTT GGTGGAAATT ACTGCCCAAG GGCAAACTGT ACAAGTGGGA
CAATCTGCCC AAGTTAGAGA CTGGTTAGAT GTATATGGTG GTGATTCTTT AATACAACAA
CGGCAAAAGG TAGCTGTTGC TTTTAGTGCA TATCAACAAG CACACCAAAC TTCAGAAAAA
CGTCGCACTT CGGAAAGAGA ACGCTTACAA CAATTCGATT TAATTACCTA TCAAGTTCAA
GAATTGAGTG CAGCGAATCT CAACTATCCG CAAGAAATAG AACAACTAAC CCAGGAAATG
CAACGCCTAA ATCATGTTGT TGATTTACAA CAAATGAGTT ACAAAGTTTA TCAAGCTTTG
TACCAAAATG AAGATGAGAC TCCTACTGCT GCTGATTTAT TGGGAGATTG TGAAACAATA
TTAAATCATA TTGTTGAGTT TGATTCCCGA ATGGAATCTA TGTTGGAATT GGTGCGAGAT
GCGGTAGCAG CAGTAATGGA AGTGGGAAGA CAAATTAGCA TTTACGGAGA AAGTTTAGAA
GCTGATCCGC AGCGGTTAGA GGAAGTAGAA GAACGGATTC GGGAACTAAA ACAAATTTGT
CGCAAATATG GACCGACTCT TACGGAAGCG ATCACTTATT TTGAACGCAT CCAAATAGAG
TTAGCAGAAC TCAATAATAA TGAACAATCA ATTGAAACTT TAGAACAACA AGAACAGGTT
TGTTTACAAT ATCTCAATCA AGTCAACCAA CAATTGACCC AACTGCGTCG TAAAACTGCG
GCTAATTTAG AATCTCATTT ATTGACTGAA CTTAAACCTT TAGGGATGGA AAAGGTAAAA
TTTCAAGTGA AAATTGCCCC TAGTTCCCCA ACAGCAATGG GTGCAGATAA AATTACCTTT
ATGTTTAGCC CTAACCCTGG TGAACCAATA CAACCTTTAA CAGAAATTGC TTCTGGTGGG
GAAATGAGCC GATTTTTACT AGCTTTAAAA GCTTGTTTTA ATCAACATGA CGGTGCGGAA
ACAATGGTAT TTGATGAAAT TGATGTGGGT GTGTCTGGAA GAATTGCCCA AGCTATTGCT
GAGAAATTAC ACCAACTTAG TCAAAATCAA CAAGTATTAT GTGTGACTCA TCAACCCTTA
GTAGCAGCAA TGGCAGATCG ACATTTTCGG GTGGATAAAC AAGTGATTAA TAAAAATGGT
AATGCTGAAC AGCGGACAGT TGTGAGAGTT ACCAGCTTGG ATAATTTAAG TACCCGTCGG
GAAGAATTAG CACAGTTAGC CGGTGGTAAA TCTGCAAATC AAGCGATGGC ATTTGCTGAA
TCTTTATTAT TACAAGCAGC TAACCACCGT CGTCAAGAAC AAAGTTAA
 
Protein sequence
MLLCLRIENF ALIDQLELDF GAGLNVLTGE TGAGKSIILD AIDGVLGGKV SSRVIRTGTS 
RALVEGTFSI NPFLAAWLSE QEIDLIDDNA VVISREIAAT ASNIRSRSRV NGVLVNRQIM
GSLRDRLVEI TAQGQTVQVG QSAQVRDWLD VYGGDSLIQQ RQKVAVAFSA YQQAHQTSEK
RRTSERERLQ QFDLITYQVQ ELSAANLNYP QEIEQLTQEM QRLNHVVDLQ QMSYKVYQAL
YQNEDETPTA ADLLGDCETI LNHIVEFDSR MESMLELVRD AVAAVMEVGR QISIYGESLE
ADPQRLEEVE ERIRELKQIC RKYGPTLTEA ITYFERIQIE LAELNNNEQS IETLEQQEQV
CLQYLNQVNQ QLTQLRRKTA ANLESHLLTE LKPLGMEKVK FQVKIAPSSP TAMGADKITF
MFSPNPGEPI QPLTEIASGG EMSRFLLALK ACFNQHDGAE TMVFDEIDVG VSGRIAQAIA
EKLHQLSQNQ QVLCVTHQPL VAAMADRHFR VDKQVINKNG NAEQRTVVRV TSLDNLSTRR
EELAQLAGGK SANQAMAFAE SLLLQAANHR RQEQS