Gene Aazo_4919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4919 
Symbol 
ID9342726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5034939 
End bp5036099 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content38% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003723178 
Protein GI298493001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.568144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATCT TATTTGTTAG CAGCAGTTCT GGGTCACGAG GTGGAGGTGA ACTCTATTTA 
ATTTATCTGG GGCAAGAACT AGCAAATCGG GGCTATGCTG TGGGATTGTG GTGTTCTCAA
CATCCGATCA TGAATGAATT AGCAAGTTCA TTTGGTAGGT TTGGAGAAGT ATTGCGCTCG
CCATACCAAA ACACATACAG TCTTAAATTA AGATCCTTTA CCCATATATT CCCCCAAATA
AATCAAAAGA TTATTTCTCA GTGGCAAGCA TTTAAACCTG ATATTCTTCA TTTCAACAAG
CAGAATTTAG AAGATGGTTT AGATTTACTA TCCTGGAGTC ATTATTTATC AATTCCGTAT
TTGGTAACCA TTCATATTAC CCAAACGCAG GACAGCTTAG GGGCATTCTT GGGAAAGTGG
CGGGATTTAA TAGCCAAAAT CTCATTGAGA AAATATAGAG GTTCACTAGT AGCAATTTCC
GAACATCGAG GTAAAGAATT AACATCTTTC CTTGCTACCT CATCTGCAAG TTCAGAAAAA
ATAGTCGTTA TCCAGAATGG AGTTCCAATA CCAGAGGAAA CAGAACATTT AGTCAAACGA
CAAGCATCTC GTTTACAACT AAAACTTCAT CCAGAAGAAT TATTAATATT AGCAGTGGGA
AGAATGGAAG CACAAAAACA ACCACTGCTA TTTTTACAAT GGGCTAGTCA CCTCAAAAGC
AATCTACCAT CTGCTCGTTT CTTATGGGTG GGAGATGGTC GCTTAACTTC TTTATGGGAT
CAATGGGTGA TAGAAAATCA TGCCCAAGAC TATATCCAAC GCCTAAGCTG GCAAAATGAC
GTAACACCAT ATTTAGCAGC CGCAGACGGA TTTTTTCATC CTGCGGCCTT TGAAGGTTTA
CCATTTGCAC TATTAGAAGC AATGGCTTGG TCTTTACCCT GTGTAATTAC CTCAACTCTG
GCGGATGAGT TAAAGTTTCC TCAAGGTGTT TATTTCGTAG CTTCTGAACA AGACCAATTT
AAAGATCTAA AGAATTTTAT TAATTCTCAG GAGCGTAATG CAGTAGCAAA TGTTGGCTAT
CAAATAATCA AAGAGCAATT TTCCCTGGAA AAAATGGTAG ATACCTACGT ATCCTTATAT
ATAGCAATTC TAAATTATTA A
 
Protein sequence
MRILFVSSSS GSRGGGELYL IYLGQELANR GYAVGLWCSQ HPIMNELASS FGRFGEVLRS 
PYQNTYSLKL RSFTHIFPQI NQKIISQWQA FKPDILHFNK QNLEDGLDLL SWSHYLSIPY
LVTIHITQTQ DSLGAFLGKW RDLIAKISLR KYRGSLVAIS EHRGKELTSF LATSSASSEK
IVVIQNGVPI PEETEHLVKR QASRLQLKLH PEELLILAVG RMEAQKQPLL FLQWASHLKS
NLPSARFLWV GDGRLTSLWD QWVIENHAQD YIQRLSWQND VTPYLAAADG FFHPAAFEGL
PFALLEAMAW SLPCVITSTL ADELKFPQGV YFVASEQDQF KDLKNFINSQ ERNAVANVGY
QIIKEQFSLE KMVDTYVSLY IAILNY