Gene Aazo_5157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5157 
Symbol 
ID9342965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5283315 
End bp5284514 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content36% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003723338 
Protein GI298493161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAT CAAAACAGCA TATTTGTCAT ATAGTTTTGA GTAATATTGG TGGAGCGCCA 
CGGACTGCGG ATTCCCTAAT TTCTTCCCAA TCTAAAGCTG GGTACAAAGT CTCTAGCGTA
GTATTGACTA ATTTAGATCC TCAATGGATA GTAGCCTTTC AAGCTGCTGA AAAACTAATA
ATCATGAAAG TTGCAGGTAG TTTATTTTAT ATCGGTGGAC CGATACACCA ACTATGGATA
GCTATCCAAT TAAGGAAAGT AATTTCTGAC CTAAAACCAG ATATTGTTGT TTGCCATACG
GCATTTATCA CTAAACTTTT TTATATCTCT CAACTTATTC CTGGCAGTTT TTCAGTTCCC
TCTATCAGTT ATATTCATAC TGATGTTATT TCTGAACTAC CTGCTGAAAG CAAAAGCAGG
TTATACTCAG TGATAAAACT ATTACAGAAT TTATTTATAC TTATTGATAA TTGGATTAGT
GTACGTAGTC TTCAGCAAGC TAGTGGATTA GTATTTGTTT GTAAAAGTCT ATATGAAAGA
TTCTTAGATC TGGGCTTAAG TCCTCGTCGC ATAGCAATAT GTTATAATCC AGCAATACCT
GATCCAAGTC ATAAGCCGTT AAATGCTACA GCGGAATCGT GGTTCAAAAG CCCTGATTTA
ATTACCTTTG TATCTGCTTC CAGATTTCAT CATCAAAAAG ATCATCAAAC ATTACTCAAA
GCATTTGCTC AAGCTAGTCA ATATCACTCC AACATCCGGT TAATTTTACT AGGAGATGGT
GGTTTAGAAA CACAAATCCA AAAATTAGCG ACTTCTTTAG GAATTAGTAA TCTTGTTTTA
TTTGCAGGTA CTGTTACTAA TCCTAGAGCT TACTTCTCAT TATCTAGAGC AGTTATACTT
GGTTCTCATT ATGAAGGATT TGGTATGGTG CTTGTAGAAG CTGTAGCTAG TGGGGTAACG
TTTATTTCCT CTGATTGTCC TGTTGGTCCC CGTGAGATTT CTGAAGTGCT GCAATGTGGA
ACTTTAGTAC CAACAAATGA TGTTGATGCT TTAGCACAAG CAATTATTAC TCATGTAGAA
ACACCTAAAG AAATAATAGA CCGTTCTGAG CAAATAGAAA GACTTTTTAG TGAGTCTACC
TGTGCCAATA GCTTAGAAAT TTTGCTCCAG GAGGTGTTTG GTGAAAAACT GTATAAATGA
 
Protein sequence
MIKSKQHICH IVLSNIGGAP RTADSLISSQ SKAGYKVSSV VLTNLDPQWI VAFQAAEKLI 
IMKVAGSLFY IGGPIHQLWI AIQLRKVISD LKPDIVVCHT AFITKLFYIS QLIPGSFSVP
SISYIHTDVI SELPAESKSR LYSVIKLLQN LFILIDNWIS VRSLQQASGL VFVCKSLYER
FLDLGLSPRR IAICYNPAIP DPSHKPLNAT AESWFKSPDL ITFVSASRFH HQKDHQTLLK
AFAQASQYHS NIRLILLGDG GLETQIQKLA TSLGISNLVL FAGTVTNPRA YFSLSRAVIL
GSHYEGFGMV LVEAVASGVT FISSDCPVGP REISEVLQCG TLVPTNDVDA LAQAIITHVE
TPKEIIDRSE QIERLFSEST CANSLEILLQ EVFGEKLYK