Gene Aazo_5201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5201 
Symbol 
ID9343008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5324089 
End bp5325381 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content29% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003723362 
Protein GI298493185 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAC AGGGAATTAA TGTATTAGTA GATGGATATA ACCTAGAAAT GATCCAAGGG 
ACAGGAATAA AAACCTACGG TTTCACCTTA GTTAAGGCTC TGATTGCTCT AGAAGCAAAT
GTAGATTTAT TATGTAGTCG TTATACTAAT AGTTATAATA GTGATTTACT TTTAAATGAA
GCTTTATTTT TCGACATACA AAAATCTAAT AGTAGCAATT TAGAAATTAA GAGTATTATT
AGTGCTGCTA TTCAAGGATT TTATCAAGCC AAAGAAGTGC AAGTTAGTGA TTTTGTCATC
AAACGAGATG CCGACTATAT TTTTGAGTAT TTAGCTAGTT CAGGAAAAAT ATTCAATATT
TCTAATTGTT ATAGAACAGC CAATAATTTA TATAAACATT TTCACTTACA AACCAGAGTA
AACATTAAAA AGAAAATTGA TATCTGGCAT GTTACTTATC CAATTCCTAT TAAAGTAAAT
GCTGCCAGAA AAATTACAAC TATTCATGAT TTAATTCCGT TAAAACTTCC TTATACTACT
TTAGATGATA AAAAATGTTT TTTTAATTTA ATCAAGGATG CAATTAAAAA TTCCGAGATT
ATTCTAACGG TTTCAGAAAG TACAAAAAAT GATATCTTAC ATTGTTTTGA TGTTAATCCA
GATAAAATTT ATGTGACATA TCAACCAATA ATTGATAATT CACATTTGGT TGAAAACCAT
ACAACAGAAA CTAAGTTAAA AAAGTATAAA CTTAAAAATA AACAATATAT TCTATTTGTA
GGAACTATAG AACCTAAAAA AAATATAGGC CGATTAATAG ATGCATATAG TGGTTTAGAT
ACTGATATGC AGCTAGTTAT TGTTGGCAAA AAAGGATGGT TATGGGAAGA TGAAATCGGT
AAATTAGAAG CAGTATTTGG TAAAGATTTT AGCAGGGAAA TTAAGTTATT GGAATATGTA
GAGAAAAAAG ATTTATTATA TCTCTATAAT GGTGCTTTTT GTTTTGTTTT TCCATCTTTG
TACGAAGGAT TTGGTTTACC ACCTCTAGAG GCTATGTCTT TGGGATGTCC TGTTGTAACC
TCTAATGTAG CTTCTTTACC AGAAGTTTGT GGAAATGCTG CTCTTTATGT AGATCCTTTC
GATTCAGATG AAATTAGACT GGGAATTGAG AAGTTGATAA ATAATCCTCA AATACAAAAC
CAACTTATAG AAGCTGGCAA AGAAAGAGTA AAACTATTTA GTATGGAAAA TTATGCAAAT
AAACTTTATG AAGCTTATAC AAAAGTAATC TAA
 
Protein sequence
MDLQGINVLV DGYNLEMIQG TGIKTYGFTL VKALIALEAN VDLLCSRYTN SYNSDLLLNE 
ALFFDIQKSN SSNLEIKSII SAAIQGFYQA KEVQVSDFVI KRDADYIFEY LASSGKIFNI
SNCYRTANNL YKHFHLQTRV NIKKKIDIWH VTYPIPIKVN AARKITTIHD LIPLKLPYTT
LDDKKCFFNL IKDAIKNSEI ILTVSESTKN DILHCFDVNP DKIYVTYQPI IDNSHLVENH
TTETKLKKYK LKNKQYILFV GTIEPKKNIG RLIDAYSGLD TDMQLVIVGK KGWLWEDEIG
KLEAVFGKDF SREIKLLEYV EKKDLLYLYN GAFCFVFPSL YEGFGLPPLE AMSLGCPVVT
SNVASLPEVC GNAALYVDPF DSDEIRLGIE KLINNPQIQN QLIEAGKERV KLFSMENYAN
KLYEAYTKVI