Gene Aazo_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1980 
Symbol 
ID9339773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2059508 
End bp2060644 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content46% 
IMG OID 
Productchaperone protein DnaJ 
Protein accessionYP_003721179 
Protein GI298491002 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGCG ACTATTATGA AATTCTGGGT GTCTCTCGTG ACGCCGACAA AGAAGAAATT 
AAACAGGCTT ATCGCCGCCA AGCCCGGAAG TATCACCCAG ATGTGAACAA AGAACCGGGT
TCTGAAGAAC AATTTAAAGA AATCAATCGT GCTTATGAGG TTTTGTCTGA GCCAGAAACG
CGCGAGCGTT ATAACCGTTT TGGTGAAGCT GGTGTATCTG GTGCAGCAGC TGGCGCTGGC
TTCCAAGATA TGGGTGATAT GGGCGGTTTT GCTGATATCT TTGAAAGCAT TTTCAGTGGC
TTTGCTGGGG GAATGGGTAG TCCAACCCAG CAGCAAAGAC GACGCAGTGG ACCTGTGCGC
GGTGATGACT TACGGCTAGA CCTGAAGTTA GATTTTCGGG AAGCGGTATT TGGTGGTGAA
AAGGAAATTC GCATTTCTCA TTTAGAAACT TGTGAAGTGT GTAGTGGTTC TGGTGCTAAA
CCAGGTACTC GTCCCCGTAG TTGTGCCACT TGTAGTGGTT CTGGCCAAGT CCGGCGTGTG
ACTAGAACGC CGTTTGGTAG TTTTACTCAA GTTTCTACTT GTCCTACTTG TAATGGCACA
GGGACGGTAG TTGAGGATAA GTGTGATGCG TGTGATGGTA AAGGCGCAAA TCAGGTCACG
AAAAAGCTAA AAATTACTAT TCCGGCTGGG GTAGATAATG GTACACGCTT ACGAATCTCT
CAAGAAGGTG ATGCAGGTCA ACGTGGTGGA CCTGCTGGAG ATTTGTATGT TTATTTGTTT
GTGAATGAGG ATGAGGAATT CCAGCGAGAT GGAATTAATG TTCTCTCAGA AATCAAAATT
AGTTACCTGC AAGCGATTTT AGGTTGTCGT TTGGAGGTGA ATACTGTTGA TGGTCCTGTG
GAGTTGATTA TTCCGGCGGG AACTCAGCCA AATACGGTGA TGAAGTTGGA AAATCGTGGT
GTACCCCGTT TGGGAAATCC TGTTAGTCGG GGCGACCATA TGTTGACGGT GTTAATTGAT
ATTCCCAATA AGATCGCGCC GGAGGAGAGA CAACTGTTGG AGCAATTGGC TAAAATTAAG
GGAGACAGAA CTGGTAAAGG TGGTATAGAA GGATTTTTGG GAAGTTTATT TAAGTGA
 
Protein sequence
MARDYYEILG VSRDADKEEI KQAYRRQARK YHPDVNKEPG SEEQFKEINR AYEVLSEPET 
RERYNRFGEA GVSGAAAGAG FQDMGDMGGF ADIFESIFSG FAGGMGSPTQ QQRRRSGPVR
GDDLRLDLKL DFREAVFGGE KEIRISHLET CEVCSGSGAK PGTRPRSCAT CSGSGQVRRV
TRTPFGSFTQ VSTCPTCNGT GTVVEDKCDA CDGKGANQVT KKLKITIPAG VDNGTRLRIS
QEGDAGQRGG PAGDLYVYLF VNEDEEFQRD GINVLSEIKI SYLQAILGCR LEVNTVDGPV
ELIIPAGTQP NTVMKLENRG VPRLGNPVSR GDHMLTVLID IPNKIAPEER QLLEQLAKIK
GDRTGKGGIE GFLGSLFK