Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_5176 |
Symbol | |
ID | 9342983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 5300054 |
End bp | 5301358 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | family 2 glycosyl transferase |
Protein accession | YP_003723349 |
Protein GI | 298493172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.634801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCA GTCTGTGCAT GATTGTCAAA AACGAGGAAA CCAACCTACC AAAATGCTTG CAAAGTGTCG AAGATGTGGT AGATGAAATT GTAGTCCTCG ATACAGGTTC AAGTGATCAA ACAATCCAAA TCGCTGAACA ATTCGGCGCT AAGGTGCATT ATTTTGAATG GTGTAATAAT TTTAGTACGG CTCGTAATGA AGCTTTAAAA TATGTTACAC GAGACTGGAT CTTAGTGTTA GATGCTGATG AAAGTCTAAC ACCAGAAATA GCGCCCTATT TGCAAGAAGC AATTAATATC CAAGATTATT TATTAATCAA TCTCGTCCGT CAGGAAATTG GTGCGACTCA ATCACCGTAT TCTCTGGTTT CTCGACTATT TCGCAACCAT GCCAAGATTA AATTTGATCG TCCATATCAT GCGTTGGTTG ATGATAGTAT TGCAGCAATT TCAACTAAAG AGACTTATTG GCAAATTGGC TATTTACCAG AGGTAGCTAT TCTTCATGCT GGATATCAAA AAGCTATAAT TAGTCAGCAG CACAAATATG GTAAAGCCGC AGCCGCAATG GAGGAATTTT TTGCTGCAAA TCCTGATGAT GTTTATGTTT GCAGTAAGTT GGGTGCTTTG TATGTAGAAA TGGGGAAAAT TAATGAGGGA ATGGAATTAT TAAATCAGGG ATTAAGTCAG ATGATTGGTA ATCAATTAAA CCAGTCAAAT AATCAGGTTC ACAAAGATAA AATCCGTTTA AGGGGGTTTC AAAATTCTCA ATCAAGAAAA GTTGGAAATT CTCTTAGTCA AGATATTAAG GAAACCAATT ATGATATTTT GTATGAATTA CATTATCATT TAGGAATTGC TCATACACAT TTTAAAAATT TCAACCAGGC AATTTCCCAT TATCAAGCTG CTGTAAAGTT ACCGATTTAT CCTCTTTTAA AGTTGGGAGG ATATAATAAT TTAGGTAATT TGTTGAAGGT ATCAGCTGAT TTTCTAGGGG CAAAAAATGC TTATGAAACG GCTATCAAAA TTGATCCTAG TTTTGTGACT GGTTATTATA ATTTGGGGAT GGTATGTAAA GCTATGGGTT CGTTGGTTGA AGCCATTGAT TGTTATGACA AGGCTATTCA ATTAAATCCT GATTATGCAG AAGCTTATCA AAATTTGGGA GTAGTGCTAC TGAAAGCCGG TGATGTCGAA ACTAGTTTAG CAGCGTTTGA ATATGCGATC GCACTCCATG AAAAAAATAA TCCCCAGGAA GCACAACGTC TCCGTCAAGG GTTGCAAGAC ATGGGATTGA AATAA
|
Protein sequence | MKLSLCMIVK NEETNLPKCL QSVEDVVDEI VVLDTGSSDQ TIQIAEQFGA KVHYFEWCNN FSTARNEALK YVTRDWILVL DADESLTPEI APYLQEAINI QDYLLINLVR QEIGATQSPY SLVSRLFRNH AKIKFDRPYH ALVDDSIAAI STKETYWQIG YLPEVAILHA GYQKAIISQQ HKYGKAAAAM EEFFAANPDD VYVCSKLGAL YVEMGKINEG MELLNQGLSQ MIGNQLNQSN NQVHKDKIRL RGFQNSQSRK VGNSLSQDIK ETNYDILYEL HYHLGIAHTH FKNFNQAISH YQAAVKLPIY PLLKLGGYNN LGNLLKVSAD FLGAKNAYET AIKIDPSFVT GYYNLGMVCK AMGSLVEAID CYDKAIQLNP DYAEAYQNLG VVLLKAGDVE TSLAAFEYAI ALHEKNNPQE AQRLRQGLQD MGLK
|
| |