Gene Aazo_5176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5176 
Symbol 
ID9342983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5300054 
End bp5301358 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content36% 
IMG OID 
Productfamily 2 glycosyl transferase 
Protein accessionYP_003723349 
Protein GI298493172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.634801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCA GTCTGTGCAT GATTGTCAAA AACGAGGAAA CCAACCTACC AAAATGCTTG 
CAAAGTGTCG AAGATGTGGT AGATGAAATT GTAGTCCTCG ATACAGGTTC AAGTGATCAA
ACAATCCAAA TCGCTGAACA ATTCGGCGCT AAGGTGCATT ATTTTGAATG GTGTAATAAT
TTTAGTACGG CTCGTAATGA AGCTTTAAAA TATGTTACAC GAGACTGGAT CTTAGTGTTA
GATGCTGATG AAAGTCTAAC ACCAGAAATA GCGCCCTATT TGCAAGAAGC AATTAATATC
CAAGATTATT TATTAATCAA TCTCGTCCGT CAGGAAATTG GTGCGACTCA ATCACCGTAT
TCTCTGGTTT CTCGACTATT TCGCAACCAT GCCAAGATTA AATTTGATCG TCCATATCAT
GCGTTGGTTG ATGATAGTAT TGCAGCAATT TCAACTAAAG AGACTTATTG GCAAATTGGC
TATTTACCAG AGGTAGCTAT TCTTCATGCT GGATATCAAA AAGCTATAAT TAGTCAGCAG
CACAAATATG GTAAAGCCGC AGCCGCAATG GAGGAATTTT TTGCTGCAAA TCCTGATGAT
GTTTATGTTT GCAGTAAGTT GGGTGCTTTG TATGTAGAAA TGGGGAAAAT TAATGAGGGA
ATGGAATTAT TAAATCAGGG ATTAAGTCAG ATGATTGGTA ATCAATTAAA CCAGTCAAAT
AATCAGGTTC ACAAAGATAA AATCCGTTTA AGGGGGTTTC AAAATTCTCA ATCAAGAAAA
GTTGGAAATT CTCTTAGTCA AGATATTAAG GAAACCAATT ATGATATTTT GTATGAATTA
CATTATCATT TAGGAATTGC TCATACACAT TTTAAAAATT TCAACCAGGC AATTTCCCAT
TATCAAGCTG CTGTAAAGTT ACCGATTTAT CCTCTTTTAA AGTTGGGAGG ATATAATAAT
TTAGGTAATT TGTTGAAGGT ATCAGCTGAT TTTCTAGGGG CAAAAAATGC TTATGAAACG
GCTATCAAAA TTGATCCTAG TTTTGTGACT GGTTATTATA ATTTGGGGAT GGTATGTAAA
GCTATGGGTT CGTTGGTTGA AGCCATTGAT TGTTATGACA AGGCTATTCA ATTAAATCCT
GATTATGCAG AAGCTTATCA AAATTTGGGA GTAGTGCTAC TGAAAGCCGG TGATGTCGAA
ACTAGTTTAG CAGCGTTTGA ATATGCGATC GCACTCCATG AAAAAAATAA TCCCCAGGAA
GCACAACGTC TCCGTCAAGG GTTGCAAGAC ATGGGATTGA AATAA
 
Protein sequence
MKLSLCMIVK NEETNLPKCL QSVEDVVDEI VVLDTGSSDQ TIQIAEQFGA KVHYFEWCNN 
FSTARNEALK YVTRDWILVL DADESLTPEI APYLQEAINI QDYLLINLVR QEIGATQSPY
SLVSRLFRNH AKIKFDRPYH ALVDDSIAAI STKETYWQIG YLPEVAILHA GYQKAIISQQ
HKYGKAAAAM EEFFAANPDD VYVCSKLGAL YVEMGKINEG MELLNQGLSQ MIGNQLNQSN
NQVHKDKIRL RGFQNSQSRK VGNSLSQDIK ETNYDILYEL HYHLGIAHTH FKNFNQAISH
YQAAVKLPIY PLLKLGGYNN LGNLLKVSAD FLGAKNAYET AIKIDPSFVT GYYNLGMVCK
AMGSLVEAID CYDKAIQLNP DYAEAYQNLG VVLLKAGDVE TSLAAFEYAI ALHEKNNPQE
AQRLRQGLQD MGLK