Gene Aazo_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3845 
Symbol 
ID9341648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3896723 
End bp3897958 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content37% 
IMG OID 
Productgroup 1 glycosyl transferase 
Protein accessionYP_003722483 
Protein GI298492306 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000452091 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATA TTTCACAAAT AGGGACACAT ATTAGGGAGA AAACTGCTTA TCCAGATATC 
CTTGTTATCT CCCGCATATT TCAGCCACAA GAAGCTGTCA TTGGAGAATA TATATATAAT
CGCTGTTTAC AAGACCCAGA AAGAGTAATC GTCCTAACCG CTAGTTGTTT AGGAGATAGA
ATATTTGATA AATCTCAAAA TTTTCCTGTT TATCGTTGGC CTAACTTTAC TTTCTGGACT
AGTACATTAT TGACTAAATT AGTGAAGCCC ATATTCAATA TTATTGGCTC CGTATTACTA
GCCATAAAGC TTTATTTCCG TTATCATTAC CGCTACATTG AATGGTGTCA CGGTTACGAT
TTCCCCGCCT TACTTATACT AAGTTATATC TTACCTATTC GCTTTTTTAT CTACCTCCAC
GGTAATGATT TAGTTAGTAA TTTACGTAAT CCATTGTGGC GATCACTATT TAAACTTACC
CTCAAAAGAG CCGAAGGAAT TGTTTGCAAC AGTTCCTATA TTCGAGATAT TTTAAGAAAA
AACTTTCGGC TAGATACTCC TACTCATGTA ATTAACCCAG TAGTAAGACC AGAAAAATTT
GGTACTCCTA CCAGTCCCAG TCATCTCGAT GATTTACGTA TCCGGTTACG TCAAGCTTAT
AATATTCCTG AAACAGCTAT TCTGATTCTT TCTGTTGGTA GATTAGTTCA ACACAAAAGC
TTTGACCGCA TCATAGATAA CATTCCTTTA CTATTAACTA TAGGCATAGA TGTCCATTAC
ATAATTTGTG GCACCGGACC TTGTGAACAA CAGCTAAAAT CCCAAGCCCA ACGCTTGCGG
GTAGACAAAC GAGTACACTT TGCAGGCCAT GTACCAGAAC GAGAATTAGC TAGTTATTAT
GTAGCCTGTG ACATTTTCTC CATGCTAACT TTGTGGGAAG ACAAAGATAA AAGTATAGAT
AACTTTGGCA TGGTTTACTT AGAGGCAGAA TACTTTGGTA AGCCCATAAT TGCCTCTCGT
TTAGGGAGTA TTTTAGATGC AGTTCACCAT GAAGAAAATG GCCTGTTGGT AAATCCCAAT
TCTGGCTATG CAGTTTTGCA AGCTTTTAAA CGCTTATGTC AAGACAAACA ACTACGAGAA
AAACTCGGTC GTCAAGGACA AGAATTTGCC AAACGGAAAA CATATCACCG TTGGCTATAT
AATCCAGAGT CTCGTTATTC TTGTTTATTG AATTAG
 
Protein sequence
MEHISQIGTH IREKTAYPDI LVISRIFQPQ EAVIGEYIYN RCLQDPERVI VLTASCLGDR 
IFDKSQNFPV YRWPNFTFWT STLLTKLVKP IFNIIGSVLL AIKLYFRYHY RYIEWCHGYD
FPALLILSYI LPIRFFIYLH GNDLVSNLRN PLWRSLFKLT LKRAEGIVCN SSYIRDILRK
NFRLDTPTHV INPVVRPEKF GTPTSPSHLD DLRIRLRQAY NIPETAILIL SVGRLVQHKS
FDRIIDNIPL LLTIGIDVHY IICGTGPCEQ QLKSQAQRLR VDKRVHFAGH VPERELASYY
VACDIFSMLT LWEDKDKSID NFGMVYLEAE YFGKPIIASR LGSILDAVHH EENGLLVNPN
SGYAVLQAFK RLCQDKQLRE KLGRQGQEFA KRKTYHRWLY NPESRYSCLL N