Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2048 |
Symbol | |
ID | 9339840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 2126491 |
End bp | 2128359 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | chaperone protein DnaK |
Protein accession | YP_003721226 |
Protein GI | 298491049 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00388714 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAG TAGTAGGAAT TGATTTAGGA ACTACAAACT CTTGTGTTGC CGTGATGGAA GGAGGGCAAC CTTTAGTAAT TGCTAATTCT GAGGGGCAAC GTATTACTCC TTCGGTTGTA GCTTACACAA AAACTGGTGA ACGCTTAGTT GGTCAAATTG CTAGACGACA AGCGGTAATG AACCCAGAAA ATACTTTTTA TTCTGTGAAA CGTTTCATTG GGCGCAAATA TGAAGAAATT ACCCATGAAG CCACGGAAGT TTCTTATAGA ACTCTGCGGG ATAGTAATGC CAATGTAAAG CTCAATTGTC CCGCTGCAGG TCAAGAATTT GCACCAGAAG AAATTTCTGC CCAAGTGCTA AGAAAGTTAG TTGATGATGC TAGTAAATAT TTAGGAGAAA AAGTTACTCA AGCGGTGATT ACAGTTCCTG CTTATTTTAA CGATTCTCAA CGGCAAGCTA CTAAAGATGC AGGTAAAATT GCTGGGTTAG ATGTTCTCCG CATTATCAAT GAACCAACAG CAGCAGCTTT GGCTTATGGT TTGGATAAAA AGGAAAATGA AACTATCTTA GTATTTGACC TTGGTGGCGG TACTTTTGAT GTTTCTATTT TGGAGGTTGG GGATGGTGTA TTTGAAGTTA AATCTACTAG TGGGGATACT CATTTAGGTG GTGATGACTT CGATAAAAAG ATTGTTGATT GGTTAGCAAA TCAGTTTCAG AGTAACGAAG GCATTGACTT ACGTAAGGAT AAACAAGCTT TGCAAAGATT GACTGAAGCT GCGGAGAAGG CAAAAATTGA GCTTTCTAGT GCAACTCAAA CTAATATCAA TTTGCCCTTT ATTACTGCTA CTCAGGCGGG TCCAAAACAC TTGGATATGA TGCTGACACG GGGTAAATTT GAGGAGATGA CAGCCGACCT TCTCGACCGT TGTCGTAAAC CAGTCCAACA AGCATTGCAA GATGCAAAAC TCAGTAATGC TCAACTTGAT GAAATTGTTT TAGTTGGTGG TTCGACTCGC ATTCCCGCTG TGCAAGAACT GGTGCGACGG ATGACGGGTA AAGAACCTTG TCAAGGTGTA AATCCTGATG AAGTTGTAGC GGTGGGTGCG GCTATTCAAG CGGGTGTTTT ATCCGGTGAA GTCAAAGATA TTTTACTGCT TGATGTTACG CCGTTGTCTT TGGGTGTGGA AACTATTGGC GGTGTGATGA CTAAGATTAT TAGCCGCAAT ACAACTATCC CGGTGAAGAA ATCAGAAGTC TTTTCTACGG CTGCTGATGG TCAAAGTAAT GTGGAAGTTC ACGTTTTGCA AGGTGAGAGG GAACTAGCTA AAGATAATAA GAGTTTAGGT ACTTTCCGTT TGGATGGTAT TCCTCCAGCA CCGAGAGGTG TACCGCAAAT TGACGTTACT TTTGACATTG ATGCTAACGG TATTCTTTCT GTTACTGCTA AGGATAAAGC CACGGGCAAA CAGCAGTCCA TTTCTATTAC AGGTGCTTCT ACTCTCGATA AGCGGGATGT AGAAAAGATG GTGCGGGATG CAGAATCTCA TGCGGAGGAA GATAGAAGAC GACGTGAACA AATTGATACT AAAAATTTGG GTGATTCTTT AGTTTATCAA GCTGAGAAAC AACTCAGAGA CTTGGGTGAT AAGGTGAGTG CTGTTGATAG AGGACGAGTT GAAGATTTGG TCAAGGATTT GGCGGAAGCT ATCAATCAAG ATCATTTCGA TCGGATTAAG TCTCTGAGCA GTCAATTACA GCAAGTGTTG ATGCAGGTTG GTAGTACGGT TTATGCACAA GCAGGAAGTT CTGATGGAAG TAGTAGGAGT GAAGATGTGA TTGATGCTGA CTTTGTAGAA AATAAATAA
|
Protein sequence | MAKVVGIDLG TTNSCVAVME GGQPLVIANS EGQRITPSVV AYTKTGERLV GQIARRQAVM NPENTFYSVK RFIGRKYEEI THEATEVSYR TLRDSNANVK LNCPAAGQEF APEEISAQVL RKLVDDASKY LGEKVTQAVI TVPAYFNDSQ RQATKDAGKI AGLDVLRIIN EPTAAALAYG LDKKENETIL VFDLGGGTFD VSILEVGDGV FEVKSTSGDT HLGGDDFDKK IVDWLANQFQ SNEGIDLRKD KQALQRLTEA AEKAKIELSS ATQTNINLPF ITATQAGPKH LDMMLTRGKF EEMTADLLDR CRKPVQQALQ DAKLSNAQLD EIVLVGGSTR IPAVQELVRR MTGKEPCQGV NPDEVVAVGA AIQAGVLSGE VKDILLLDVT PLSLGVETIG GVMTKIISRN TTIPVKKSEV FSTAADGQSN VEVHVLQGER ELAKDNKSLG TFRLDGIPPA PRGVPQIDVT FDIDANGILS VTAKDKATGK QQSISITGAS TLDKRDVEKM VRDAESHAEE DRRRREQIDT KNLGDSLVYQ AEKQLRDLGD KVSAVDRGRV EDLVKDLAEA INQDHFDRIK SLSSQLQQVL MQVGSTVYAQ AGSSDGSSRS EDVIDADFVE NK
|
| |