Gene Cag_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1796 
Symbol 
ID3747216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2316188 
End bp2319331 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content46% 
IMG OID637774334 
Productankyrin 
Protein accessionYP_380090 
Protein GI78189752 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00006002 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTC TTGAATATAC AGGATTTGAT AGTAGCAGTG TGGCGGAAAG TTACCGCAAA 
GTAGCTACGG CGTTGGCGCA AGGCGATTTT AGAGCAGCAC AGGTAAAAAA GCTTGTTAAC
CTAACACATG GCAAATTTTA CCGTGCCAAG CTTGATGCCG CAAATCGCTT GCTCTTCACC
TTTGTGCGTT ACGGCGACGA GGTCTGCTTG CTTATGTTGG AAGTGATTAT GGGGCACAAC
TACCACAAGT CGCGCTTTTT GCGGGGTGCA CCACTTGAGG AAGAAAAAAT CCCTGATGTT
GATGCAAGTG AAGCACTTAA CGATGCTGAA CAGCTCCGCT ACCTTCACCC CAATCATACT
GAAATTCACC TGCTTGATAA GCCTATCTCC TTTGATGATG CGCAGCAAGC AGTCTATTTG
CACAAGCCGC CGCTCATTAT TGTGGGAAGT GCTGGAAGTG GTAAAACTGC CTTAATGCTC
GAAAAGTTGA AGCATGTTGA GGGCGAAGTG CTCTATGTAA CGCACTCACA ATATTTGGCG
CAAAACGCTC GTAACATTTA CTATGCGTAT GGTTTTGAGC ATCCTGCACA AGAGGCGCAC
TTTCTTTCCT ATCGTGAATT TGTGGAGTCC ATTCGTGTGC CAACAGGACG TGAAGCAACA
TGGCGCGATT TTGCAGCATG GTTTTATAGG ATGCGCAGCA ACTTTAAAGA GATTGATCCC
CATCAAGCCT TTGAGGAGAT TCGGGGCGTT ATTACCGCGC CTGAAGATGG TTGCCTCAGT
CGTAAGAATT ACTTGCAACT TGGCGTGCGC CAATCCATTT TTTCAAAAGA GCAACGTTCA
ATACTGTATG ACCTCTTTCT CAAATATCGC CATTGGCTAA CGGATTCGGG TTTATTCGAC
CTTAACCTGA TTGCGCACGA ATGGAAAGCC TCGCCTCGCT ACGATTTTGT GCTGATTGAT
GAGGTGCAAG ATATGACGGT AGCCCAACTT TCGCTTGTTC TGAAAAGCTT AAAAAAGGCG
GGACATTTTC TGTTATGTGG TGATTCTAAC CAAATTGTTC ACCCTAACTT TTTTGCATGG
AGCCACGTTA AAACGCTGTT TTGGAAAGAT CCTAACCTTG CGGGAAAGAA GCAGTTACAG
GTGCTTACGG CAAACTTCCG CAACGGACGC GAAGCAACGC GCATTGCGAA TCAACTGCTC
AAACTCAAAC ATCAGCGCTT TGGCTCAATT GACCGTGAAA GCAATTTTTT AGTGGAAGCA
ATTGGTGGCG CTGAAGGGCA AGCTCAGCTT ATGGCTGATA CCGATGCCAC AAAACGTGAA
TTCAACAAAA AAATCAGCCA CTCCACGCGC TTTGCAGTTT TGGTGATGCG CGATGAAGAG
AAGCAAGAGG CTCGCAAATA TTTTTCTACC CCATTACTTT TTTCTATCCA CGAAGCAAAA
GGGCTTGAGT ACGACAACAT TGTGCTCTTC CGCTTTGTTT CATCCTGCCG CCGCGAATTT
AACGACATTG CAGAAGGTGT TTCGCTTACC GATTTAGAGG CAATTGATTC GCTTGAGTAC
TGCCGCGCCA AAGAAAAAGG CGATAAATCA CTCGAAGTCT ATAAGTTCTT TATTAACGCC
CTTTACGTCG CCCTTACTCG TGCGGTAAAA AATCTTTACC TCATTGAATC CGACACCAAA
CACCGCCTTT TTGAATTGCT GGGACTTGCT GTTGCTGGCA AGGTAGAGGT CGCCGCTGAG
GAATCGTCGC TTGAGGAGTG GCAAAAAGAG GCACGCAAGC TTGAATTGCA AGGCAAACAA
GAGCAAGCCG AAGCCATCCG CCGCGATATT TTAAAAGAGG TGCCACCCCC ATGGCAAGTG
TGCAATGAAA CACGCTTAGA CGAATTGATC CATAAAGTGT TTAAAGAAAA AGCGCCAGGC
AACAAATTCA AACAGCAACT TTACGAATAT GCCACATGCC ATGTAGAGCC AGTGCTTGCA
CAAGCCCTTG AAAAGCAAAC CGACTATCGT TCACCGCACG GTTCATTTTG GGAACACCTT
GATACCATTG GGCGCAAAAG TTATTTGCCA TACTTTAGCC AGCAAACCAA AGCCATTCTT
CGCCAATGCG AACAACACGG GCCAAACCAT CGCTTGCCGA TGAACCAAAC GCCGCTCATG
GCAGCCGCAG CCGCAGGCAA CATTGCATTA ACAGAAGCAT TGCTGGAACG TGGAGCCGAC
CCAACCCTAA ACGATCACTA CGGCTACAAC GCCTTACATT GGGCAATGCG CCAAGCCTTT
CGCGATAACC GTTTTGCACG CACAACGTTT GGAACCCTCT ATGAACGGCT TGCACCAGCG
GCTGTGGACA TTAGTAGCGG TGAACGGATG ATACGGCTCG ATCGCCACTT GGCAGAATAT
CTGCTCTTTC AAACCTGCTG GGTTCTCTTT AAAAGCCGTT TTACAACGCT TGAGCTCAAT
GGCGAATATC CAGCATTTGA CACTTCGCTT ATTTTAGAAG CATGGGAACA TATGCCCGAC
AACGTAGTGC CCACAGAACG CAAACGCCGC ACCTACCTCT CCAGTGTGCT TGCCCGCAAC
GAAGTTTCAC GCAATTACGC TTACAACCGC TCGCTTTTTG AGCGCCTTGC AACAGGATGG
TATCAATTCA ACCCAGCACT ACATGTACGC ACTTCGGTAA CAGAGGAGGG ACAATCTCCA
TGGATTCCGA TTTTTCAAGC CGTAAACTTG CCCTTAATTA GTAAATTTTG CCATTCACAC
ACCATTGCTA CCATCGTACA ATGCTTCCGC AAAGCATGCA TGGCAGTAAT ACCCGAATTG
GAAGCGGAAA TTGCTCAGCA ACAAGCAACA AAAGCCGCAA AAGAGCAGCA CCTACAAACA
CTTGTAAAAC AAGTTAAAAA AAAGATAACG CCATCATCAG ACTCTCTTGC CGCAAAACTT
CTCAAACAAC ACAAATTGAG CAAAAAGTTA GATGATGAGC TGTTAGTGCC ATTTCTGAAG
TTTGTTCGCG AAAAAGAGCT TGAGGAAATA AGGCAGCAGA AGATGAAAAA GAAGCTTGAA
AGAGAGGAGC GGCAACAAAT AAAAGCGGCT GAACAAGCAA AACGTGATGA ACAAGTGCAA
CAGCAACTTG GATTTGATTT TTAA
 
Protein sequence
MKILEYTGFD SSSVAESYRK VATALAQGDF RAAQVKKLVN LTHGKFYRAK LDAANRLLFT 
FVRYGDEVCL LMLEVIMGHN YHKSRFLRGA PLEEEKIPDV DASEALNDAE QLRYLHPNHT
EIHLLDKPIS FDDAQQAVYL HKPPLIIVGS AGSGKTALML EKLKHVEGEV LYVTHSQYLA
QNARNIYYAY GFEHPAQEAH FLSYREFVES IRVPTGREAT WRDFAAWFYR MRSNFKEIDP
HQAFEEIRGV ITAPEDGCLS RKNYLQLGVR QSIFSKEQRS ILYDLFLKYR HWLTDSGLFD
LNLIAHEWKA SPRYDFVLID EVQDMTVAQL SLVLKSLKKA GHFLLCGDSN QIVHPNFFAW
SHVKTLFWKD PNLAGKKQLQ VLTANFRNGR EATRIANQLL KLKHQRFGSI DRESNFLVEA
IGGAEGQAQL MADTDATKRE FNKKISHSTR FAVLVMRDEE KQEARKYFST PLLFSIHEAK
GLEYDNIVLF RFVSSCRREF NDIAEGVSLT DLEAIDSLEY CRAKEKGDKS LEVYKFFINA
LYVALTRAVK NLYLIESDTK HRLFELLGLA VAGKVEVAAE ESSLEEWQKE ARKLELQGKQ
EQAEAIRRDI LKEVPPPWQV CNETRLDELI HKVFKEKAPG NKFKQQLYEY ATCHVEPVLA
QALEKQTDYR SPHGSFWEHL DTIGRKSYLP YFSQQTKAIL RQCEQHGPNH RLPMNQTPLM
AAAAAGNIAL TEALLERGAD PTLNDHYGYN ALHWAMRQAF RDNRFARTTF GTLYERLAPA
AVDISSGERM IRLDRHLAEY LLFQTCWVLF KSRFTTLELN GEYPAFDTSL ILEAWEHMPD
NVVPTERKRR TYLSSVLARN EVSRNYAYNR SLFERLATGW YQFNPALHVR TSVTEEGQSP
WIPIFQAVNL PLISKFCHSH TIATIVQCFR KACMAVIPEL EAEIAQQQAT KAAKEQHLQT
LVKQVKKKIT PSSDSLAAKL LKQHKLSKKL DDELLVPFLK FVREKELEEI RQQKMKKKLE
REERQQIKAA EQAKRDEQVQ QQLGFDF