Gene Aazo_3925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3925 
Symbol 
ID9341729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3987887 
End bp3989152 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content39% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003722551 
Protein GI298492374 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AACCTCTTCG CATCGCCTTA TTTACAGGAT TGTTTCCTCC ATTCTTAACA 
GGAGTTTCAG TCGCAGTACA TCAGCGAGTA CGTTGGTTAC TTGAACAAGG ACATCAAGTT
TTTCTCATCC ACCCAGAAAT AAACAATCAG TATCCTAAAA TAGTTAGTAA TCGTCCCATG
CCGGGACTAG AAGAACTACA ATCTTTCCCT GGATTTTCAT CTTACGCCTT TCCTACACAA
CCACTAATCT TCTACAAATC GCTACCTCAA CCACTCAACT ATCGCCATTG GAGCGATACT
AAATTACTAG AGAAATTTCA GCCTGATATT ATCATTGTTG AAGAAGCAGC ACAGATGAGG
GGGTTATACT CAATTTTCTT GCAAGGCTAT GGTCGGCCGA TAGGAGTTGA ATACGCTAAA
CGTACCAAAA CCCCAATTAT CTCCGTGTTT CATACTGATA TCGTGGCTTA TATCCGATAT
TATTTAGGAG ATGTATTCTT CAGTTTACTA CGCCCAATCG TTCCTCTTTT AGTGAAGCAG
TTTAGTAATG CGTATAGTCT CAATTTATTT CCATCTAGAG AACAACTATC TAAATACCAA
AAGCTCAAAT GTAAACGGGT TGAATACGTT CCTTATCAAG GAATTAATTG TGAAAAATTT
CACCCCCGGA ACATCTGTTA TGACCCAAGA CCTAATGATC AACGCCCAAC TATTCTCTTC
GTTGGACGCA TCACAGCAGA GAAAAATGTC ACTCAACTTT TAGATGCATT TCCATTCATT
GCTGCTAAAA TTCCCGATGT CCACTTAGTT ATTATTGGCA GCGGACCTTT AGATCAAGAA
ATTCGTCGCC GCGCTCAAGC TTTCCCATTT GGAGTAACAA TTTGGGGTGA ATCTCACGGT
ACCGAACTTT TGGGATGGTT CGCTAGAGCC GATGTTTTTG TTAACCCTTC AGTCACGGAA
AACTTCTGCA CTACAAATAA CGAAGCTTTA GCTTCTGGAA CTCCTGTAGT TGCAGCTATC
GCTCCTTCAA CTCCTGAACA AGTGATCATT GGTTATAATG GCTTTCTTGC TCAACCCAAC
AACCCCAAAG ATTTTGCTGA GAAAATAATT AAAATTCTCG AAAATTCTGA CCTCAAAGCA
CAATTATCTA AGCAATCTCG TCCTTCAATA TTAGAATTTG ATTGGTCAGT ATGTAGCGAA
AAATTTGAAG ATAAGCTCTA CCAATTAGTT GGAATACCCA AAATAGTTGA GTTATCTAAT
AATTAG
 
Protein sequence
MKKQPLRIAL FTGLFPPFLT GVSVAVHQRV RWLLEQGHQV FLIHPEINNQ YPKIVSNRPM 
PGLEELQSFP GFSSYAFPTQ PLIFYKSLPQ PLNYRHWSDT KLLEKFQPDI IIVEEAAQMR
GLYSIFLQGY GRPIGVEYAK RTKTPIISVF HTDIVAYIRY YLGDVFFSLL RPIVPLLVKQ
FSNAYSLNLF PSREQLSKYQ KLKCKRVEYV PYQGINCEKF HPRNICYDPR PNDQRPTILF
VGRITAEKNV TQLLDAFPFI AAKIPDVHLV IIGSGPLDQE IRRRAQAFPF GVTIWGESHG
TELLGWFARA DVFVNPSVTE NFCTTNNEAL ASGTPVVAAI APSTPEQVII GYNGFLAQPN
NPKDFAEKII KILENSDLKA QLSKQSRPSI LEFDWSVCSE KFEDKLYQLV GIPKIVELSN
N