Gene Aazo_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1519 
Symbol 
ID9339311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1591619 
End bp1594183 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content42% 
IMG OID 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003720844 
Protein GI298490667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00861946 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACTC CGACACCAGA ACACAATCAA CCAAATACAC CTCTTAGGGA TACGAAACTG 
GTAGAAGACC GCAGCAAGCT GAGTAAGATG TATCAGCACT ATGTAGAAAT GAAGGATAAA
TATCCTCATG CGTTGCTACT ATATCGCGTC GGTGATTTTT TTGAAACATT TTTCCAAGAC
GCTGTAACCG TATCCAGAGA ATTAGAATTA GTTTTAACCA GTAAACATGG TGGTGAGCTT
GGTCGTATTG CTATGACTGG TGTACCTCAT CACGCTTGGG AACGCTACAC AACTCTACTC
GTAGAAAAAG GTTACGCTGT GGTGATTTGT GACCAAGTTG AAGATTCATC GGAAGCAGTG
GGTTTGGTAA AACGCGAAGT AACCCGCATC CTTACCCCTG GGACTTTGTT GGAAGAAGGA
ATGCTACAAA CAAGTCGCAA TAATTATTTG GCGGCTGTGG TAATTGCTGT CAATCATTGG
GGGTTGGCTT ATGCAGATAT ATCAACTGGA GAATTTCTCA CCTGTGAAGG TAGTGATTTA
GAACACTTGA CCCAAGAGTT AATGCGATTG CAACCATCGG AAGTGTTGTT TCCTACCAAC
GCCCCCGATT TAGGTACTTT ACTGCGTCCA GGGGAAACTT CGCCATCTCT TCCCCAATGT
TTACCACCTA CATTTTGTTA CAGTTTGCGA TCGCAACTTC CCTTTTCTCA ATCCGAAGCT
AGAAGTAAAT TATTGCAGAA ATTCAAACTG CGATCGCTCG AAGGCTTAGG TTGTGAACAT
CTTCCCCTCG CAGTTCGCGC TGCTGGTGGA CTTTTGGAAT ATATCGAAGA TACCCAAAAA
CAAAACCCCG TTCCTCTGCA ACTATTACAC ACCTACAGCC TAACCGATTA TCTTATCGTT
GACCACCAAA CCCGACGTAA CTTAGAAATT ACCCAAACCG TGCGCGATGG CACATTTCAC
GGTTCTTTGT TATGGGCTTT AAACCGCACT ACTACCGCCA TGGGTGGACG AGCCTTAAGA
AGATGGTTGT TACAACCGCT ACTTGATATT AAAGGCATTC GAGCGCGACA ATATACCATT
CAAGAATTAT GTGAAAATAC TCCTTTACGT CAGGATTTAC GGAGATTATT ACGGAAGATC
TATGATTTAG AACGTTTAAC AGGTCGTGCA GGTTCAGGAA CAGCCAATGC ACGGGATTTA
ATGGCTTTAG CTGATTCTTT CTCTAGTTTA CCTGAATTAT CTCGTATAGT AGAAGATGCG
CGCTCGCCAT TCTTGAAAGC TTTGCAAAAA GTACCACCTG TACTAGAGGA ACTAGCAGAA
AGGTTACAGG CTCACATCGT AGAATCACCA CCAATCCATC TCAAAGAAGG TAGTTTGATT
CGCCCCGGGA TCAACCCGCT TTTAGATGAA AGAAAAGCTA CTGTAGAAGC GGATCAAAAA
TGGATTGCAA ATCTAGAAGT TGATGAAAGA GCGAGAACAA GAATCCCGAC TTTAAAGGTA
GGATTTAATA AAACCTTTGG CTACTATATT AGTATTTCTC GTTCCAAATC TGACCAAGTA
CCTGATAATT ACATCCGTAA GCAAACTCTG ACGAACGAGG AACGATACAT TACTCCAGAC
CTGAAAGAAC GAGAAGCGCG GATTCTCACA GCGCGAGATG ACTTAAATCA GTTGGAATAT
GAGATTTTCG CTGCTTTGCG GGATGAAGTC GGTTCTCACG CGGAAACCAT TCGTAATATT
TCCCATGCGG TAGCTGCTGC TGATGTATTA TGCGGATTAG CTGAGTTAGC GGTACATCAA
GGTTACTGTT GTCCGGAAAT GGTGGAAGGA TGGGAAATTG AGGTTATTGA TGGTCGTCAT
CCAGTAGTGG AACAATCTTT ACCAGCGGGG TTTTTTGTTC CTAACTCGAC AACATTGGGG
ACTGGGAATG AACCTACTAA TCACCAATCA CCTGATTTAG TCATTCTCAC AGGGCCGAAT
GCGAGTGGCA AGAGTTGTTA TTTACGTCAA GTGGGGTTAA TTCAGTTGAT GGCGCAGGTT
GGTAGTTTTG TGCCAGCGCG TTCTGCTAAG TTGGGAGTGT GCGATCGCAT TTTTACCCGT
GTAGGTGCTG TAGACGATTT AGCAACAGGT CAATCTACAT TTATGGTAGA AATGAATGAA
ACTGCAAATA TTCTCAATCA TGCAACTGTT AAATCATTAG TTTTATTAGA TGAAATTGGC
CGTGGAACAG CAACATTTGA TGGTCTTTCA ATAGCTTGGG CTGTAGCAGA ATATTTAGCA
GTAGAGATTA AATCTCGGAC AATTTTTGCA ACTCACTATC ATGAATTAAA TGAATTAGCG
AGTATTGTTT CTAATGTGGC TAATTATCAA GTTACCGTGA AAGAATTACC TGATCAAATT
ATCTTTTTAC ATCAAGTTCA GCCAGGTGGT GCTGATAAAT CCTATGGTAT TGAAGCAGGA
AGATTAGCAG GTTTACCCAC GGTGGTAATC AAGCGTGCAA AACAAGTAAT GGGACAAATT
GAAAAACATA GTAAAATTGC TGTGGGTTTG CGTGAAGGAC TGTAA
 
Protein sequence
MTTPTPEHNQ PNTPLRDTKL VEDRSKLSKM YQHYVEMKDK YPHALLLYRV GDFFETFFQD 
AVTVSRELEL VLTSKHGGEL GRIAMTGVPH HAWERYTTLL VEKGYAVVIC DQVEDSSEAV
GLVKREVTRI LTPGTLLEEG MLQTSRNNYL AAVVIAVNHW GLAYADISTG EFLTCEGSDL
EHLTQELMRL QPSEVLFPTN APDLGTLLRP GETSPSLPQC LPPTFCYSLR SQLPFSQSEA
RSKLLQKFKL RSLEGLGCEH LPLAVRAAGG LLEYIEDTQK QNPVPLQLLH TYSLTDYLIV
DHQTRRNLEI TQTVRDGTFH GSLLWALNRT TTAMGGRALR RWLLQPLLDI KGIRARQYTI
QELCENTPLR QDLRRLLRKI YDLERLTGRA GSGTANARDL MALADSFSSL PELSRIVEDA
RSPFLKALQK VPPVLEELAE RLQAHIVESP PIHLKEGSLI RPGINPLLDE RKATVEADQK
WIANLEVDER ARTRIPTLKV GFNKTFGYYI SISRSKSDQV PDNYIRKQTL TNEERYITPD
LKEREARILT ARDDLNQLEY EIFAALRDEV GSHAETIRNI SHAVAAADVL CGLAELAVHQ
GYCCPEMVEG WEIEVIDGRH PVVEQSLPAG FFVPNSTTLG TGNEPTNHQS PDLVILTGPN
ASGKSCYLRQ VGLIQLMAQV GSFVPARSAK LGVCDRIFTR VGAVDDLATG QSTFMVEMNE
TANILNHATV KSLVLLDEIG RGTATFDGLS IAWAVAEYLA VEIKSRTIFA THYHELNELA
SIVSNVANYQ VTVKELPDQI IFLHQVQPGG ADKSYGIEAG RLAGLPTVVI KRAKQVMGQI
EKHSKIAVGL REGL