Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4730 |
Symbol | |
ID | 9342537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4834170 |
End bp | 4836218 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | chaperone protein DnaK |
Protein accession | YP_003723047 |
Protein GI | 298492870 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.16532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAAGG TAGTCGGCAT CGACTTGGGT ACAACCAACT CAGTAGTCGC CGTAATGGAG GGTGGCAAGC CGGTGGTGAT TGCCAATGCT GAAGGTATGC GAACAACTCC CTCTGTTGTT GGCTTCACCA AAGAAGGTGA AAGGGTGGTT GGACAAATGG CAAGACGGCA AACTGTTCTT AATCCCCAAA ATACCTTTTT TGCAGTTAAA CGCTTCATTG GGCGGAAATA TGGTGAACTT AACCCAGATT CCAAGCGTGT ACCCTATACC ATCCGCAAAG ATGAAATAGG CAGTATTAAA GTTGCGTGTC CCCGCTTAAA TAAAGAATTT ACCCCAGAAG AAATTTCAGC TATGGTGCTG AGGAAATTGG CAGATGATGC TGGTCGCTAT TTGGGAGAAA CTGTCACTGG GGCTGTAATT ACCGTTCCAG CTTATTTTAA TGATTCCCAA CGCCAGGCTA CCCGCGATGC TGGGAGAATT GCTGGTTTAG ATGTTTTGCG AATTCTCAAT GAACCGACTG CGGCTTCTTT GGCTTACGGA TTATATCGGG GTGAAACGGA AACCATCTTA GTTTTTGATT TGGGTGGTGG GACTTTTGAT GTGTCGATTT TGGAAGTAGG TGACGGCATA TTCGAGGTTA AAGCCACTAG TGGAGATACG CAATTAGGTG GTAATGATTT TGATAGAAAG ATAGTTGATT GGTTGGCAGG ACAATTTTTA GAAGCAGAGG GTGTAGATTT AAGAAGTGAT CGCCAAGCTT TACAAAGGTT AATGGAAGCC GCAGAAAAGG CCAAAATTGA ACTTTCTGCT GTCAGTGTCA CCGATATTAA CCTACCCTTC ATCACCGCCA CAGAGGACGG ACCTAAACAT TTAGAAACAC GCCTGACGCG ATCGCAATTT GAAGGACTAT GTGGTGACTT AATCAGCAGA GTGCGAACAC CAGTCAAAAG GGCGCTAAAA GATGCCGGAC TTTCCCCAGT AGATATTGAA GAAGTTGTAC TAGTAGGCGG TTCCACCAGG ATACCAATAG TAAAACAGCT AGTGCGGGAC TTCATAGGTA TGGAACCCAA CGAAAACGTC AACCCTGATG AAGTTGTGGC CGTAGGTGCA GCAATTCAAG CAGGTATTTT AGCCGGCGAA CTCAAAGATG TATTGCTGTT AGATGTCACA CCCCTATCTT TAGGACTGGA AACCATTGGT GGCGTGATGA AAAAGCTCAT TCCCCGTAAC ACCACAATAC CAGTACGACG CTCCGACATT TTTTCCACCT CTGAAAATAA CCAAAACAGC GTAGAAATCC ACGTTGTCCA AGGTGAGAGA GAAATGGCAG CAAATAATAA GTCTTTGGGA AGATTTAAGC TGTATGGCAT CCCACCAGCA CCACGAGGCA TCCCACAGGT TCAAATATCC TTTGATATTG ATGCCAACGG GATTTTACAG GTAACGGCTT TAGATCGTAC CACTGGCAGA GAACAGAGTA TCACAATTCA AGGCGCTTCT ACCTTGAGTG AATCAGAAGT AAATCGCATG ATTCAAGATG CTCAAAAATA TGCTGATGTT GACCGGGAAA GGAAAGAACG GGTAGAAAAA CGCACTCGTT CCGAGGCATT GATTTTACAA GCAGAACGGC AATTGAGGGA AGTAGCCTTG GAAATGGGGA TGCAGTTTGC CCGTAACCGT CGTCAACGCA TTGACAATAT TTGCCGGGAA CTGCGTGAAA GTTTAAAAGA TAATGATGAT CGCGGTATTG ACCAAGCTTA CTCTGACCTG CAAGATGCTC TGTATGAGCT AAATCGCGAA GTGCGAGAGT ATTATGCTGA AGATGAAGAT GAAGACCTAT TTGGTGCCAT CCGTGACATC TTCACTGGTG ATAAAGAACG GGAACGGGAT TATTCCAGAG AAAACTATCG GGAACCCGAT TCTAATAACA GAGACTATAG TCGAGACTAT GGTAGAGACA ATCGTTCTCC CTCCTATGAT AGTCCTCCAC CACACCGCCG CCGTCCCACC TACCGGGATA ATTGGGATGA AGACGATGAT TGGCTGTAA
|
Protein sequence | MGKVVGIDLG TTNSVVAVME GGKPVVIANA EGMRTTPSVV GFTKEGERVV GQMARRQTVL NPQNTFFAVK RFIGRKYGEL NPDSKRVPYT IRKDEIGSIK VACPRLNKEF TPEEISAMVL RKLADDAGRY LGETVTGAVI TVPAYFNDSQ RQATRDAGRI AGLDVLRILN EPTAASLAYG LYRGETETIL VFDLGGGTFD VSILEVGDGI FEVKATSGDT QLGGNDFDRK IVDWLAGQFL EAEGVDLRSD RQALQRLMEA AEKAKIELSA VSVTDINLPF ITATEDGPKH LETRLTRSQF EGLCGDLISR VRTPVKRALK DAGLSPVDIE EVVLVGGSTR IPIVKQLVRD FIGMEPNENV NPDEVVAVGA AIQAGILAGE LKDVLLLDVT PLSLGLETIG GVMKKLIPRN TTIPVRRSDI FSTSENNQNS VEIHVVQGER EMAANNKSLG RFKLYGIPPA PRGIPQVQIS FDIDANGILQ VTALDRTTGR EQSITIQGAS TLSESEVNRM IQDAQKYADV DRERKERVEK RTRSEALILQ AERQLREVAL EMGMQFARNR RQRIDNICRE LRESLKDNDD RGIDQAYSDL QDALYELNRE VREYYAEDED EDLFGAIRDI FTGDKERERD YSRENYREPD SNNRDYSRDY GRDNRSPSYD SPPPHRRRPT YRDNWDEDDD WL
|
| |