Gene Aazo_3736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3736 
Symbol 
ID9341541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3793251 
End bp3794555 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content34% 
IMG OID 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_003722402 
Protein GI298492225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.730052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGAC AACGGCCAAT TGCAGTTGAC TTATTTGCAG GTGCAGGAGG TATGACCCTT 
GGCTTTGAAC AAGCAGGTTT TGATGTACTT GTATCTGTAG AACTAGACCC AATTCATTGT
GCAATTCATA AATTTAACTT TCCCTTTTGG AAAGTCTTAT GTAAAAGTGT AGAAGAAACA
ACAGGGTCAG AAATTAGAAA TAGTTCTGAC ATTGGTAATC AAGAAATTGA TGTAGTGTTT
GGCGGTCCAC CATGTCAAGG CTTTTCATTA ATTGGTAAAC GTTCTATCGA TGACCCTAGA
AATACTCTAG GTTTCCATTT TATTAGGTTG GTTTTGGAAC TACAACCTAA TTTTTTTGTT
TTGGAAAATG TTAAAGGAAT GACCGTAGGT AAACACAAAG AATTTACTAC GGAAATAATT
GATAAGTTTG AAAATAATGG TTATAAAGTA AATCGAAATT ATCAATTATT AAATGCTGCT
AATTATGGAG TACCACAAAA TCGAGAAAGA TTATTTTTAT TAGGTTGTCG TCAAGATTTA
AAATTACCAA ATCATCCAGA TAAAATTACC CATCCTGCCA AATCTAATAA CTCTATAGCT
TGCACCACAA TTGCACTATC AAAATTACCA TCAACACCTA CAGTTTTGCA AGCTATTCAA
GACCTACCGG AAATAGAAAA TTATCCAGAA TTATATCAAC AGGATTGGGT AGTAACTGAT
TTTGGAAAAC CTAGTAATTA CGGGAAAAAA ATGCGTCATC CTAGCCTATC CAAAAATAAT
TATTCCTATC AGCGTAAGTT TAATCATAAC ATTCTAACAT CCAGTTTAAG AACAAAACAT
AATCCCGAAT CCATCGAAAG ATTTGCATTA ACTCCCTATG GAAAAATCGA ACCAATCAGC
CGTTTTTATA AACTAGCTCC TGATGGCTTA TGTAACACAC TCAGAGCAGG AACAGCAAGT
AATAAAGGTG CATTTACTTC CCCTCGTCCC ATACATCCTT TTAAACCTAG ATGTATTACT
GTTAGAGAAG CTGCACGGTT GCATTCTTAT CCTGATTGGT TTAGATTTCA CCCTACAAAA
TGGCATGGTT TTAGACAAAT CGGTAACTCT GTTCCTCCAC TTTTAGCTCA AGCTGTAGCA
TCAGAAATTA TTAAAGTGTT AGGTATAAAA TCTTCCCAAC TTAAGTTTGG TAAAGATTTA
AAAGATTTAG GAGAAACCAG GTTATTAACA TTTGATATGT CAGAAGCTGC TAAATATTTT
GATGTTAATC CTGATGTTAT AGAACCTAGA ATTAGAAAGA AATGA
 
Protein sequence
MFRQRPIAVD LFAGAGGMTL GFEQAGFDVL VSVELDPIHC AIHKFNFPFW KVLCKSVEET 
TGSEIRNSSD IGNQEIDVVF GGPPCQGFSL IGKRSIDDPR NTLGFHFIRL VLELQPNFFV
LENVKGMTVG KHKEFTTEII DKFENNGYKV NRNYQLLNAA NYGVPQNRER LFLLGCRQDL
KLPNHPDKIT HPAKSNNSIA CTTIALSKLP STPTVLQAIQ DLPEIENYPE LYQQDWVVTD
FGKPSNYGKK MRHPSLSKNN YSYQRKFNHN ILTSSLRTKH NPESIERFAL TPYGKIEPIS
RFYKLAPDGL CNTLRAGTAS NKGAFTSPRP IHPFKPRCIT VREAARLHSY PDWFRFHPTK
WHGFRQIGNS VPPLLAQAVA SEIIKVLGIK SSQLKFGKDL KDLGETRLLT FDMSEAAKYF
DVNPDVIEPR IRKK