Gene Aazo_4553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4553 
Symbol 
ID9342358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4643823 
End bp4645094 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content42% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003722939 
Protein GI298492762 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCTA CCACTAACAA ACGCATTGCC TTGATTTCAG TCCACGGTGA TCCGGCGATT 
GAAATTGGGA AAGAGGAGGC TGGAGGACAA AATGTTTATG TTCGCCAAGT GGGTGAAGCA
CTATCCCAGC TAGGATGGCA AGTTGATATG TTTAGCCGCA AAGTGAGTGT TGACCAAGAA
GATATCGTTC AACATAATTC TCGTTGTCGA ACCATTCGTT TAACAGCCGG ACCAGTTGAA
TTTGTACCAC GAGATAACGG TTTTAAATAC TTGCCAGAAT TTGTGGAGCA GTTATTGGAA
TTTCAAAAAC AAAACAGCAT TAAATATGAG TTGGTTCATA CTAACTACTG GCTATCTAGT
TGGGTTGGGT TGCAGCTGAA ACAAATCCAA GGAAGTAAAC AGGTTCACAC ATATCATTCT
TTAGGAATAG TCAAATACAA CACAATAGAA AATATTCCTC TAGTTGCTAG TCAACGTCTA
GCAGTGGAAA AAGAAGTATT GGAAACAGCG GAAAGAATTG TGGCGACAAG TCCGCAAGAA
AAACAACACA TGAGAACTCT GGTTTCTCAT CAAGGAAACA TTGATATTAT TCCTTGTGGT
ACAGATATTC GCCATTTTGG TTCAGTGGAT AGACAAGCAG CTAGAGAAGC ATTGGGAATT
GATCCACAAG CCAAAGTTGT TTTGTATGTA GGGCGTTTTG ACCCACGCAA AGGGATAGAA
ACCTTAGTGC GTGCTGTGCG TGAGTCTAAG TTTTATGGTG ATAAAGACTT AAAACTGATT
ATTGGTGGTG GAAGTACACC AGGTAACAGT GATGGTAGAG AACGTGATCG CATCGAGGGA
ATTGTTAACG AATTGGGAAT GAGTAAATTT ATTTCCCTTC CTGGTCGTCT CAGTCGAGAA
GTCTTACCAA CTTATTACGG TGCGTCTGAT GTTTGTGTGG TTCCCAGTCA CTATGAACCC
TTTGGACTCG TGGCTGTGGA AGCAATGGCC AGTGGAACAC CAGTTATAGC TAGTGATGTT
GGTGGTCTTC AGTTTACCGT TGTTAATGAA AACACTGGCT TATTAGTACC ACCCCAAGAC
GTAGCAGCCT TTAGTAACGC CATTGACCGC ATTCTTGGTA ATCCCCAATG GCGTGCACAA
CTAGGTCAAT CGGGTAATAG ACGGGTAATG AGTAAGTTTA GCTGGGACGG TGTAGCTAGT
CAGTTAGATG CCCTATACAC CCAACTACTG CAACCAGTTA AAGAAAAAGA ACCTGCTTTA
GTTAGTAAGT GA
 
Protein sequence
MNSTTNKRIA LISVHGDPAI EIGKEEAGGQ NVYVRQVGEA LSQLGWQVDM FSRKVSVDQE 
DIVQHNSRCR TIRLTAGPVE FVPRDNGFKY LPEFVEQLLE FQKQNSIKYE LVHTNYWLSS
WVGLQLKQIQ GSKQVHTYHS LGIVKYNTIE NIPLVASQRL AVEKEVLETA ERIVATSPQE
KQHMRTLVSH QGNIDIIPCG TDIRHFGSVD RQAAREALGI DPQAKVVLYV GRFDPRKGIE
TLVRAVRESK FYGDKDLKLI IGGGSTPGNS DGRERDRIEG IVNELGMSKF ISLPGRLSRE
VLPTYYGASD VCVVPSHYEP FGLVAVEAMA SGTPVIASDV GGLQFTVVNE NTGLLVPPQD
VAAFSNAIDR ILGNPQWRAQ LGQSGNRRVM SKFSWDGVAS QLDALYTQLL QPVKEKEPAL
VSK