Gene Aazo_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2541 
Symbol 
ID9340340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2645403 
End bp2646551 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content40% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003721560 
Protein GI298491383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.433843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATAG CCTGGATTGG AAAAAAATCG CCATTTTGCG GCAATGTCAC CTACAGTCGA 
GAAATTACTA ATGCGTTGCT AGACCGGGGA CATCAAGTTA GCTTTCTTCA CTTTGCTCAA
GAAGAATCCC AAGCAGATAA CTGGCCTAAT TTTCAGGAAG TTTCTTTACC CTTCATTTAC
AAATCTCAGG TTTACACTAT TCCCACTTTT AAAGCGACTA AGGTTTTAAC TCAGTCACTA
AGGGAAATCA AGCCAGATAT TGTTCATGCT TCCTTGACTC TATCGACTCT GGATTTTGTT
TTACCAGAAA TTTGTGAAGA ATTAAATGTC CCTCTCATTG CCACTTTCCA CACTCCATTT
GCTGGTAAAG GGGCAAAATT AATTTCTGGT ACCCAGCTTT TAGCTTATCA ACTATACGCA
CCTTTTTTAG ATCACTATGA TCGGGTCATC GTTTTTTCCC AAATTCAAAG GGAATTATTG
GCACGCATGG GAGTTAGGGA AGAAAAAATT GCTGTTATTC CTAACGGTGT TGATACTGCT
AAGTATTCTC CTGGTAGTTC TCAAATAAAA GCCGAATTTG GTGCAGAGCG CTTATTTGTC
TATCAAGGTC GCATAGCCCC AGAGAAAAAC GTTGAATCCA TGCTACGCGC TTGGAAGCAG
TCAGATATGG CGACTGATAG TAAATTGTTA ATGGTTGGTG ATGGCCCGTT AAAATCTTCC
TTAGAAACTT TTTATGGTGC AGAATACGGT ATCCACTGGT TAGGATTTAT AGCAGATGAA
AACCGCCGCA TAGAAATATT ACGCGGTGCA GATGTATTTA TTTTACCTTC TTTGGTTGAA
GGTCTATCTT TATCTCTTTT AGAAGGAATG TCCTGTGGTG TAGCTTGTTT AGCCACTGAT
GTGGGTGCAG ATGGGGAAGT ATTGGAAAAA GGTGCAGGTG TAGTGATTAG TACCAGTTCT
GTGCGATCAC AACTCAGAAC ACTTTTACCA CTATTCCAAG ATCATCCAGA GTTAACAACC
CTGTTGGGGC AGAAAGCCAG AAAGCGAGTA TTAGAACGTT ATACCCTGAA TGATAATATC
ACGCAATTAG AAGAACTTTA TAACCGAGTT CTTGCACAGC GACCTTTAAC ACTAAGTTGG
GGTGTTTAA
 
Protein sequence
MRIAWIGKKS PFCGNVTYSR EITNALLDRG HQVSFLHFAQ EESQADNWPN FQEVSLPFIY 
KSQVYTIPTF KATKVLTQSL REIKPDIVHA SLTLSTLDFV LPEICEELNV PLIATFHTPF
AGKGAKLISG TQLLAYQLYA PFLDHYDRVI VFSQIQRELL ARMGVREEKI AVIPNGVDTA
KYSPGSSQIK AEFGAERLFV YQGRIAPEKN VESMLRAWKQ SDMATDSKLL MVGDGPLKSS
LETFYGAEYG IHWLGFIADE NRRIEILRGA DVFILPSLVE GLSLSLLEGM SCGVACLATD
VGADGEVLEK GAGVVISTSS VRSQLRTLLP LFQDHPELTT LLGQKARKRV LERYTLNDNI
TQLEELYNRV LAQRPLTLSW GV