Gene Aazo_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4068 
Symbol 
ID9341873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4129464 
End bp4130969 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content33% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003722647 
Protein GI298492470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.736051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTG GTAAACCTAA TTTAAAAATC ATTGTAGATG GAGTGTTTTT CCAACTTCAT 
ATTTTGGGGG TAGCTAGAGT ATGGATGGAA TTACTCAAGG CATGGATTAA AAGCGGATAT
GCTGAAAACA TAATTGTTTT AGATCGTCAA GGAAGTGCAT TTAATTTTCC TCCAGATGCT
GTTTTTCATC CATTAGGGAC AATGCCAAGA TTACCAGGAT TAAAGTATAG GTTAATTCCT
GGTTATAGTT ATGAAAATGC TCAAACTGAT ATGCAAATGT TACAAGAAAT TTGTGATGAG
GAAAATGCTG ATTTATTTGT TTCTACCTAC TATACAAGAC CTATTAATAC TCCCAGTGTT
TTAATGCTTC ATGATATGAT TCCTGAAATT GAGGGTTTGG ATGAACCTCA ATGGAAGCAG
AAACATGAAT GTATTAGATC TGCCTCTGCA TATATAGCAG TTTCTCAAAA TACAGCTAAA
GATTTTTCCC ACTTCTTTCC AGAAATTGAT CATGTTTTAG TTAAGGTCAT TTATAATGGT
GTAGATCATC AAGTATTTCG TCCAGCTAGT TTAACCGAAA TTAATCAATT TAAACAATCT
TATGGTATTA CTAAACCCTA TTTTCTTTTA GTGGGAGTAA GAACTGGTTA TAAGAATGCT
CTCTTATTTT TTAAGTCGTT TGCTCAGTTA CCAAATCAAG AAGATTTCTC TATTGTTTGT
GTTGGTGGTG GTTGGGGAAT AGAAGAACAA TTTAAAGAAT ATATTACTCA AACACAAATT
TTAAAATTGC AACTTACTGA TCAGGAATTA AGTATGGCTT ATTCAGGTGC GATCACACTT
GTTTACCCGT CTTTGTATGA AGGATTCGGT CTAGCTGTGT TAGAAGCTAT AGCTTGTGGT
TGTCCAGTAA TTACTTACCC CAGTTCTGCT ATTCCTGAGG TGCTTGGTAA AGCCGCGCTT
TATATTGATG ATGATATTGA AATCATGAAA AGAGCTTTAA TAACTATTCA ACATGAACAA
ATAAGACAAA CTCTCATTCA AGCAGGATTA GCACAAGCTG AGAAGTTTTC CTGGTCAAAA
ATGGCTGAAG AAGTGAGTAA TGTTTTCATT GATGAAACTC TAAAGTTTTT AAACTTGCGA
GAAATTAATC TAATCATATT CCCTGATTGG AGTCAATCAG AAGGTGATTT ATATATTCAA
CTAGTTGAAG TAATTAAAAA ACGAGTGAGC GATATTAATT CTTATAAAAC TACTTTACTG
ATATATGTTC TTGATGATAC TGAAGGGGAA ACTGCTGATT TACTGTTATC CAGTATAGCA
GTTAATTTAA TGATGGAAGA TGAGATTGAT ATTACCGAAA ATCTGGAAAT TTCACTAATG
CTAGACATCA ATGAAAAACA TTGGAAAAGT CTCTTACCAC ATTTGCATGG TAGAATTATA
TTAGATGCGG AAAATCAAGA AGTTATAGTC AAATTTTCAG CAGAAAAACT ACTAGTTTGG
AAATAG
 
Protein sequence
MTIGKPNLKI IVDGVFFQLH ILGVARVWME LLKAWIKSGY AENIIVLDRQ GSAFNFPPDA 
VFHPLGTMPR LPGLKYRLIP GYSYENAQTD MQMLQEICDE ENADLFVSTY YTRPINTPSV
LMLHDMIPEI EGLDEPQWKQ KHECIRSASA YIAVSQNTAK DFSHFFPEID HVLVKVIYNG
VDHQVFRPAS LTEINQFKQS YGITKPYFLL VGVRTGYKNA LLFFKSFAQL PNQEDFSIVC
VGGGWGIEEQ FKEYITQTQI LKLQLTDQEL SMAYSGAITL VYPSLYEGFG LAVLEAIACG
CPVITYPSSA IPEVLGKAAL YIDDDIEIMK RALITIQHEQ IRQTLIQAGL AQAEKFSWSK
MAEEVSNVFI DETLKFLNLR EINLIIFPDW SQSEGDLYIQ LVEVIKKRVS DINSYKTTLL
IYVLDDTEGE TADLLLSSIA VNLMMEDEID ITENLEISLM LDINEKHWKS LLPHLHGRII
LDAENQEVIV KFSAEKLLVW K