Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1796 |
Symbol | |
ID | 3747216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2316188 |
End bp | 2319331 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637774334 |
Product | ankyrin |
Protein accession | YP_380090 |
Protein GI | 78189752 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0210] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00006002 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC TTGAATATAC AGGATTTGAT AGTAGCAGTG TGGCGGAAAG TTACCGCAAA GTAGCTACGG CGTTGGCGCA AGGCGATTTT AGAGCAGCAC AGGTAAAAAA GCTTGTTAAC CTAACACATG GCAAATTTTA CCGTGCCAAG CTTGATGCCG CAAATCGCTT GCTCTTCACC TTTGTGCGTT ACGGCGACGA GGTCTGCTTG CTTATGTTGG AAGTGATTAT GGGGCACAAC TACCACAAGT CGCGCTTTTT GCGGGGTGCA CCACTTGAGG AAGAAAAAAT CCCTGATGTT GATGCAAGTG AAGCACTTAA CGATGCTGAA CAGCTCCGCT ACCTTCACCC CAATCATACT GAAATTCACC TGCTTGATAA GCCTATCTCC TTTGATGATG CGCAGCAAGC AGTCTATTTG CACAAGCCGC CGCTCATTAT TGTGGGAAGT GCTGGAAGTG GTAAAACTGC CTTAATGCTC GAAAAGTTGA AGCATGTTGA GGGCGAAGTG CTCTATGTAA CGCACTCACA ATATTTGGCG CAAAACGCTC GTAACATTTA CTATGCGTAT GGTTTTGAGC ATCCTGCACA AGAGGCGCAC TTTCTTTCCT ATCGTGAATT TGTGGAGTCC ATTCGTGTGC CAACAGGACG TGAAGCAACA TGGCGCGATT TTGCAGCATG GTTTTATAGG ATGCGCAGCA ACTTTAAAGA GATTGATCCC CATCAAGCCT TTGAGGAGAT TCGGGGCGTT ATTACCGCGC CTGAAGATGG TTGCCTCAGT CGTAAGAATT ACTTGCAACT TGGCGTGCGC CAATCCATTT TTTCAAAAGA GCAACGTTCA ATACTGTATG ACCTCTTTCT CAAATATCGC CATTGGCTAA CGGATTCGGG TTTATTCGAC CTTAACCTGA TTGCGCACGA ATGGAAAGCC TCGCCTCGCT ACGATTTTGT GCTGATTGAT GAGGTGCAAG ATATGACGGT AGCCCAACTT TCGCTTGTTC TGAAAAGCTT AAAAAAGGCG GGACATTTTC TGTTATGTGG TGATTCTAAC CAAATTGTTC ACCCTAACTT TTTTGCATGG AGCCACGTTA AAACGCTGTT TTGGAAAGAT CCTAACCTTG CGGGAAAGAA GCAGTTACAG GTGCTTACGG CAAACTTCCG CAACGGACGC GAAGCAACGC GCATTGCGAA TCAACTGCTC AAACTCAAAC ATCAGCGCTT TGGCTCAATT GACCGTGAAA GCAATTTTTT AGTGGAAGCA ATTGGTGGCG CTGAAGGGCA AGCTCAGCTT ATGGCTGATA CCGATGCCAC AAAACGTGAA TTCAACAAAA AAATCAGCCA CTCCACGCGC TTTGCAGTTT TGGTGATGCG CGATGAAGAG AAGCAAGAGG CTCGCAAATA TTTTTCTACC CCATTACTTT TTTCTATCCA CGAAGCAAAA GGGCTTGAGT ACGACAACAT TGTGCTCTTC CGCTTTGTTT CATCCTGCCG CCGCGAATTT AACGACATTG CAGAAGGTGT TTCGCTTACC GATTTAGAGG CAATTGATTC GCTTGAGTAC TGCCGCGCCA AAGAAAAAGG CGATAAATCA CTCGAAGTCT ATAAGTTCTT TATTAACGCC CTTTACGTCG CCCTTACTCG TGCGGTAAAA AATCTTTACC TCATTGAATC CGACACCAAA CACCGCCTTT TTGAATTGCT GGGACTTGCT GTTGCTGGCA AGGTAGAGGT CGCCGCTGAG GAATCGTCGC TTGAGGAGTG GCAAAAAGAG GCACGCAAGC TTGAATTGCA AGGCAAACAA GAGCAAGCCG AAGCCATCCG CCGCGATATT TTAAAAGAGG TGCCACCCCC ATGGCAAGTG TGCAATGAAA CACGCTTAGA CGAATTGATC CATAAAGTGT TTAAAGAAAA AGCGCCAGGC AACAAATTCA AACAGCAACT TTACGAATAT GCCACATGCC ATGTAGAGCC AGTGCTTGCA CAAGCCCTTG AAAAGCAAAC CGACTATCGT TCACCGCACG GTTCATTTTG GGAACACCTT GATACCATTG GGCGCAAAAG TTATTTGCCA TACTTTAGCC AGCAAACCAA AGCCATTCTT CGCCAATGCG AACAACACGG GCCAAACCAT CGCTTGCCGA TGAACCAAAC GCCGCTCATG GCAGCCGCAG CCGCAGGCAA CATTGCATTA ACAGAAGCAT TGCTGGAACG TGGAGCCGAC CCAACCCTAA ACGATCACTA CGGCTACAAC GCCTTACATT GGGCAATGCG CCAAGCCTTT CGCGATAACC GTTTTGCACG CACAACGTTT GGAACCCTCT ATGAACGGCT TGCACCAGCG GCTGTGGACA TTAGTAGCGG TGAACGGATG ATACGGCTCG ATCGCCACTT GGCAGAATAT CTGCTCTTTC AAACCTGCTG GGTTCTCTTT AAAAGCCGTT TTACAACGCT TGAGCTCAAT GGCGAATATC CAGCATTTGA CACTTCGCTT ATTTTAGAAG CATGGGAACA TATGCCCGAC AACGTAGTGC CCACAGAACG CAAACGCCGC ACCTACCTCT CCAGTGTGCT TGCCCGCAAC GAAGTTTCAC GCAATTACGC TTACAACCGC TCGCTTTTTG AGCGCCTTGC AACAGGATGG TATCAATTCA ACCCAGCACT ACATGTACGC ACTTCGGTAA CAGAGGAGGG ACAATCTCCA TGGATTCCGA TTTTTCAAGC CGTAAACTTG CCCTTAATTA GTAAATTTTG CCATTCACAC ACCATTGCTA CCATCGTACA ATGCTTCCGC AAAGCATGCA TGGCAGTAAT ACCCGAATTG GAAGCGGAAA TTGCTCAGCA ACAAGCAACA AAAGCCGCAA AAGAGCAGCA CCTACAAACA CTTGTAAAAC AAGTTAAAAA AAAGATAACG CCATCATCAG ACTCTCTTGC CGCAAAACTT CTCAAACAAC ACAAATTGAG CAAAAAGTTA GATGATGAGC TGTTAGTGCC ATTTCTGAAG TTTGTTCGCG AAAAAGAGCT TGAGGAAATA AGGCAGCAGA AGATGAAAAA GAAGCTTGAA AGAGAGGAGC GGCAACAAAT AAAAGCGGCT GAACAAGCAA AACGTGATGA ACAAGTGCAA CAGCAACTTG GATTTGATTT TTAA
|
Protein sequence | MKILEYTGFD SSSVAESYRK VATALAQGDF RAAQVKKLVN LTHGKFYRAK LDAANRLLFT FVRYGDEVCL LMLEVIMGHN YHKSRFLRGA PLEEEKIPDV DASEALNDAE QLRYLHPNHT EIHLLDKPIS FDDAQQAVYL HKPPLIIVGS AGSGKTALML EKLKHVEGEV LYVTHSQYLA QNARNIYYAY GFEHPAQEAH FLSYREFVES IRVPTGREAT WRDFAAWFYR MRSNFKEIDP HQAFEEIRGV ITAPEDGCLS RKNYLQLGVR QSIFSKEQRS ILYDLFLKYR HWLTDSGLFD LNLIAHEWKA SPRYDFVLID EVQDMTVAQL SLVLKSLKKA GHFLLCGDSN QIVHPNFFAW SHVKTLFWKD PNLAGKKQLQ VLTANFRNGR EATRIANQLL KLKHQRFGSI DRESNFLVEA IGGAEGQAQL MADTDATKRE FNKKISHSTR FAVLVMRDEE KQEARKYFST PLLFSIHEAK GLEYDNIVLF RFVSSCRREF NDIAEGVSLT DLEAIDSLEY CRAKEKGDKS LEVYKFFINA LYVALTRAVK NLYLIESDTK HRLFELLGLA VAGKVEVAAE ESSLEEWQKE ARKLELQGKQ EQAEAIRRDI LKEVPPPWQV CNETRLDELI HKVFKEKAPG NKFKQQLYEY ATCHVEPVLA QALEKQTDYR SPHGSFWEHL DTIGRKSYLP YFSQQTKAIL RQCEQHGPNH RLPMNQTPLM AAAAAGNIAL TEALLERGAD PTLNDHYGYN ALHWAMRQAF RDNRFARTTF GTLYERLAPA AVDISSGERM IRLDRHLAEY LLFQTCWVLF KSRFTTLELN GEYPAFDTSL ILEAWEHMPD NVVPTERKRR TYLSSVLARN EVSRNYAYNR SLFERLATGW YQFNPALHVR TSVTEEGQSP WIPIFQAVNL PLISKFCHSH TIATIVQCFR KACMAVIPEL EAEIAQQQAT KAAKEQHLQT LVKQVKKKIT PSSDSLAAKL LKQHKLSKKL DDELLVPFLK FVREKELEEI RQQKMKKKLE REERQQIKAA EQAKRDEQVQ QQLGFDF
|
| |