Gene Aazo_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4678 
Symbol 
ID9342485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4783497 
End bp4785128 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content39% 
IMG OID 
Productradical SAM domain-containing protein 
Protein accessionYP_003723014 
Protein GI298492837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.863222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACATCAT CTGTATTTAA TTCTGAACGC CTTTTATTTA CACCCGCTAC TCCCCAAACT 
GATGCTATTC CTGTGATTTT TGCTTTCCCC AATGAGTACA CGGTGGGAAT AACCAGTCTT
GGCTATCAGG TGGTATGGGC GACTTTGGCG ATGCGTGATG ATTTGCAGGT GAGTCGCTTA
TTTACTGATA TTCATGAACA ACTTCCTAGA CAACCGGAAA TACTTGGTTT TTCCATGTCT
TGGGAATTAG ATTATGTGAA TATTTTCAAT CTTTTGGAAT ATTTACAAAT TCCTCTTCGT
GCCAGTTCTC GCACTGCAAA TCATCCTCTA GTTTTTGGTG GCGGTCCTGT TCTAACTGCT
AATCCTGAAC CTTTTGCTGA TTTTTTTGAT GTGATTTTAT TGGGCGATGG AGAAAATCTG
CTGGGTGATT TTATTGATGC TTATAAAGAA GTTAGAAATG CTGATAAAGA AGCTCAATTA
AAAAAAATAG TACAAGTTCC AGGCATTTAT ATTCCGAGTT TATATTATAT TAAATATTCT
AGTTCGGATG GTGAAATATT AGCAATTAAT CCAATTTATT CAGAAGTTCC CGCAGTTATT
CAAAAGCAAA CTTACCGGGG AAACACTCTT TCTGTTTCGA CTGCGGTGAC AGAAAAAGCT
GCCTGGGAAA ATATTTTCAT GGTGGAAGTG GTGCGGAGTT GTCCTGAAAT GTGCCGCTTC
TGTTTAGCAA GTTATTTAAC TTTACCGTTT AGAACTGCGA ATTTAGAAAA TTCTTTAATT
CCCGCTATTC AACAAGGTTT AAAGGTTACT AACCGTCTGG GTTTATTGGG TGCTTCGGTA
ACTCAACATC CTGAATTTCC AGAGTTACTA GATTATATTA GTCAACCAAA ACATGATGAT
GTGCGGTTGA GTGTTGCTTC TGTTAGAACG AATACGGTAA CGGAAAAGTT AGCTACAACT
TTGGCGAAAC GGGATACGCG ATCGCTTACC ATTGCGATAG AAAGTGGTTC AGACAAATTA
CGCCAAATCA TCAATAAAAA ATTATATAAC GATGAAATTA TCCAAGCTGC GGTGAACGCG
AAAGCTGGTG GTTTATCGAG TTTGAAATTA TACGGAATGG TGGGCATTCC TGGAGAAGAA
ACGGAAGATT TAGATGCAAC GGTAGCAGTG ATGAAGGCTG TTAAAAAAGC TGCTCCTGGT
TTGCGGTTAA CTCTGGGATG CAGCACTTTT GTTCCAAAGT CCCACACACC GTTTCAGTGG
TTTGCGGTGA ATAAACAATC TGAGAAGCGG TTACAGTTTT TACAGAAACA GCTAAAACCC
CAAGGTATAG ATTTTCGTCC TGAAAGTTAT AATTGGTCTA TTATACAGGC TTTGTTATCG
AGAGGTGATC GCAGGCTTTC CTATCTGTTA GAACTAACTC GTGATTTTGG TGACTCATTG
GGCAGTTACA AACGCGCTTT CAAGGAATTA AAGGGAAAAA TTCCCGACTT AGATTATTAC
GTTTATAGTA ATTGGTCAAC TGAGCAAATT TTACCTTGGA ACCACTTGCA AGGTCCCTTA
CCTCAGTCTA CGCTAATAAA GCATTTGGCT GAAGCAATGA GTCATTTTCG ATCTGATTCA
CAAGTGAAAT AA
 
Protein sequence
MTSSVFNSER LLFTPATPQT DAIPVIFAFP NEYTVGITSL GYQVVWATLA MRDDLQVSRL 
FTDIHEQLPR QPEILGFSMS WELDYVNIFN LLEYLQIPLR ASSRTANHPL VFGGGPVLTA
NPEPFADFFD VILLGDGENL LGDFIDAYKE VRNADKEAQL KKIVQVPGIY IPSLYYIKYS
SSDGEILAIN PIYSEVPAVI QKQTYRGNTL SVSTAVTEKA AWENIFMVEV VRSCPEMCRF
CLASYLTLPF RTANLENSLI PAIQQGLKVT NRLGLLGASV TQHPEFPELL DYISQPKHDD
VRLSVASVRT NTVTEKLATT LAKRDTRSLT IAIESGSDKL RQIINKKLYN DEIIQAAVNA
KAGGLSSLKL YGMVGIPGEE TEDLDATVAV MKAVKKAAPG LRLTLGCSTF VPKSHTPFQW
FAVNKQSEKR LQFLQKQLKP QGIDFRPESY NWSIIQALLS RGDRRLSYLL ELTRDFGDSL
GSYKRAFKEL KGKIPDLDYY VYSNWSTEQI LPWNHLQGPL PQSTLIKHLA EAMSHFRSDS
QVK