Gene Aazo_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1946 
Symbol 
ID9339739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2026860 
End bp2028431 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content38% 
IMG OID 
Productradical SAM domain-containing protein 
Protein accessionYP_003721157 
Protein GI298490980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGCTT TACTACTCTG GCCGATCATG CCTAATTCTT TCTGGTCTTA TCAGGAAACC 
CTTGCTTTGG CTGGGTTACG TGCGACAAAT CCCCCACTAG GTTTAATCAC AGTAGCAGCG
ATGTTACCGA GTGATTGGGA AATTAGATTG TCCGATCGCA ATGTCCGTCT AGAAACAGAT
GCAGATTGGG AATGGTGCAA TATTGTCATC CTCTCTGCAA TGATTATCCA AAAACAAGAT
TTTGGTGAAT TAATTCAAAA AGGTAAAAGG TTAGGTAAAA AAGTCGCAGT CGGTGGACCT
TTTGCTACAT CTGTACCAGA ATTTGTCTTA GAAGCAGGAG CAGATTATTT AATTTTAGAT
GAAGGAGAAA TCACCATCCC GATGTTTTTA GAGGCTTTAG AAAAGGGAGA AGAAAAAGGT
ATTTTCCGAG CTACAGAAAA ACCAGATGTT ACCCAAACTC CCTTACCTAG ATTTGATTTA
TTAGACCTAA ATGCTTACAT AGCTATGACC GTACAGTTTT CACGGGGTTG TCCATTTCAA
TGTGAGTTCT GTGATATTAT CACCCTTTTT GGACGCAAAC CCCGCACGAA AACACCAGAA
CAGATCTTAG TAGAATTGGA AGTATTATAT CAGATGGGTT GGTGGCGTTA TGTATTTATT
GTTGATGATA ACTTTATCGG CAATAAACGT AATGCTAAAA TCTTTTTAAG GGAACTAATT
CCCTGGATGG AAAAACGAAA TTATCCTTTT GCTTTACTCA CCGAGGCTTC TTTGAATTTA
GCAGAAGATG ATGAATTATT AGAATTAATG GTAAAAGCTG GTTTTGTTCA GGTATTCATG
GGTATTGAAA CTCCTGATGT AGAAAGTTTA GTAGGAGCAA ATAAAGAACA AAATACCCGT
AAGTCTTTAG TGGAGTCCTG CCACAAAATT ACCAAAGCCG GACTACAAAT TATGTCTGGT
TTTATCTTAG GATTCGATCA TGAAAAACCT GGTGCAGGTA AACGTATTCA AGAGTTTGTG
GAAGAAACTA ATATTCCCCA AGCCCATCTT AATTTATTGC AAGCATTACC AAATACAGCC
ATGTGGAATC GGCTGCAAAA AGAAGGAAGG TTAATAGATG CGTTAGGTGA ATTTCTAGGT
TCTCAAAAAT CTTTAATTAA CTTTGTTCCT ACTCGTCCCA TGACAGAAAT AGCTGACGAG
TTTATCGAAA CTTTTTGGAA TTTGTATGAA CCCATACCTT ACCTCAAACG TACTTTTCGT
CATTTTATGA TGATGGAGGG TTGGCGGGCT AAATATCAAC GGACCTTAAC AAAAGCAGAG
TGGGACTTTT TAGGGGCTAT TTGTTGGAGA CAGGGAATAT TGCGTTCTAC AAGATTTCAT
TTTTGGTGGC AATTAATAGT TATGGCATGG CATAAACCAA ATTTATTATA TGACTATTTA
ATCGCCTTGG GTGTGGGTGA ACATTTTTTC AGTTTTCGTC ATGAGGTAAA AGTAGAAATA
GAATCAGAAT TAGCATTATT ACAGCAGCAA GAATTCGATA AAAAATCAGC AACGTTAAGT
TATAAACCGT GA
 
Protein sequence
MRALLLWPIM PNSFWSYQET LALAGLRATN PPLGLITVAA MLPSDWEIRL SDRNVRLETD 
ADWEWCNIVI LSAMIIQKQD FGELIQKGKR LGKKVAVGGP FATSVPEFVL EAGADYLILD
EGEITIPMFL EALEKGEEKG IFRATEKPDV TQTPLPRFDL LDLNAYIAMT VQFSRGCPFQ
CEFCDIITLF GRKPRTKTPE QILVELEVLY QMGWWRYVFI VDDNFIGNKR NAKIFLRELI
PWMEKRNYPF ALLTEASLNL AEDDELLELM VKAGFVQVFM GIETPDVESL VGANKEQNTR
KSLVESCHKI TKAGLQIMSG FILGFDHEKP GAGKRIQEFV EETNIPQAHL NLLQALPNTA
MWNRLQKEGR LIDALGEFLG SQKSLINFVP TRPMTEIADE FIETFWNLYE PIPYLKRTFR
HFMMMEGWRA KYQRTLTKAE WDFLGAICWR QGILRSTRFH FWWQLIVMAW HKPNLLYDYL
IALGVGEHFF SFRHEVKVEI ESELALLQQQ EFDKKSATLS YKP