Gene Aazo_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1540 
Symbol 
ID9339332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1612612 
End bp1613748 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content39% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003720854 
Protein GI298490677 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.213513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATTA ATAAAGTGCC AAATAATCAT CTATTTGTGT TCTTAGAACT CTTTACAGTT 
GAAGGTGGAA TTCAATCCTA TATTAAGGAT ATTTTTCGTG TCTATCAGGG ATTAAATCAA
ACTTGTAAAG CCGAAGTTTT CTTGTTGCGA GATAGTCCTG CTTGTATTAA TATTTTCGCA
TCAGTAAATT TAAAGTTTCA TTATTTTAAA AGCAAATCGT CCAAAATAGG TAGACTCAAA
TTGGCGATCG CATTAGTTCG ATATCTTCTG CAAAAACGAC CGCAACAAGT TTTCTGTGGT
CATATTAAGT TAGCAGGACT GATACAAATT TTATGTCAGC CCTTGAGCAT TTCCTATACA
GTACTCACTT ATGGTAAGGA GGTATGGGAA CCTCTAAATA ATACAGAACG ACGTGCTTTA
GCTTCAGCTT CAGCAATTTG GACAATTAGC CGTTATAGTC GAGATCGCGC TTGTGCTGCT
AATGGTATCG ACCCAAAAAA GGTACAAATG CTACCTTGTG CAATAGATGG AGAGAAGTTT
ACTCCTGGAG AAAAAGCACT GGAATTAATC CAAAAGTATG GTTTAAATAA TGCCAAAGTA
TTAATGACAG TGGCGCGGTT GTGGTCTGGA GATATTTATA AAGGTGTGGA TGTAACAATT
AGAGCCTTAC CACAGATTAT CCAGGTGTTT CCAGAGGTAA AATATTTAGT GATTGGCCGT
GGTAATGACC AACCAAGATT AGCCCAGTTA GCAAAAGATT TAGGTGTGAG CGATCCCCTT
ATCTTTGCTG GTTTTATACC TACAGAAGCA TTAATGTTAC ACTATCGCCT AGCCGATGCC
TATATTATGC CCTCCCAAGA AGGGTTTGGT ATAGTTTACC TAGAAGCAAT GGCTTGTGGT
GTCCCAGTGT TATCTGGTGA TGATGATGGC TCGGCTGACC CATTACAAGA TGGTAAACTA
GGATGGAGAG TACAACACCG GAATCCTGAT GCAGTGGCAG CAGCTTGTAT TGAAATTCTT
CAAGGTCAGG ATCAACGTTG TGATGGTAAA TGGTTAAGAG AGCAAACGAT CGCTATTTTT
GGGATACAAG CTTTTCAACA ACGTTTACAG CAAATGCTCC AATCAACTAA TAACTAA
 
Protein sequence
MPINKVPNNH LFVFLELFTV EGGIQSYIKD IFRVYQGLNQ TCKAEVFLLR DSPACINIFA 
SVNLKFHYFK SKSSKIGRLK LAIALVRYLL QKRPQQVFCG HIKLAGLIQI LCQPLSISYT
VLTYGKEVWE PLNNTERRAL ASASAIWTIS RYSRDRACAA NGIDPKKVQM LPCAIDGEKF
TPGEKALELI QKYGLNNAKV LMTVARLWSG DIYKGVDVTI RALPQIIQVF PEVKYLVIGR
GNDQPRLAQL AKDLGVSDPL IFAGFIPTEA LMLHYRLADA YIMPSQEGFG IVYLEAMACG
VPVLSGDDDG SADPLQDGKL GWRVQHRNPD AVAAACIEIL QGQDQRCDGK WLREQTIAIF
GIQAFQQRLQ QMLQSTNN