Gene Cag_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1523 
Symbol 
ID3747157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1999556 
End bp2002366 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content44% 
IMG OID637774063 
ProductDEAD/DEAH box helicase-like 
Protein accessionYP_379821 
Protein GI78189483 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATAT CCGAAGCGCA AACTCGTTCC CAGCTTATTA ATAAGCTGCT TGCCCAATCA 
GGGTGGAATG TTAACGACCA AACGCAAGTT GTTGCGGAAT TTGACATTGC AATATCGCAC
ACCCAGCACA TAGCGGAACC ACTCACCCCA TACCACAGTC ATCAGTTTAG CGATTACGTT
TTACTCGGTA AAGATGGTAA GCCCTTAGCC GTTATTGAAG CTAAAAAGAC AAGTAAGAAT
GCAGCTTTAG GGCGCGAACA AGCCAAGCAA TACTGTTACC ATGTGCAGCG TCAGCAAGGT
GGAGTGCTTC CATTTTGCTT TTACACCAAT GGGCTTGAAA CATACTTTTG GGATTTAGAG
AATTACCCTC CACGCAAAGT GGTAGGTTTT CCAACCCGCG ATGATCTTGA ACGATTTCAC
TATATCCATC GCAACAAAAA GCCGTTGACG CAAGAGCTGA TTAATACCGC TATTGCTGGT
AGAGATTACC AAATACGGGC TATTCGGGCA GTACTTGAAG GCATCGAACA AAAAAGGCGA
GATTTTCTCT TGGTTATGGC AACGGGCACA GGCAAAACCC GTACTTGTAT TGCGTTAGTG
GATGCGCTTA TGCGTGCTGG TCATGCTGAA AAAGTGCTCT TTTTGGTTGA TCGTATTGCG
TTACGTGAGC AGGCGTTAGA TGCGTTTAAA GAGCATTTAC CTAATGAACC TCGCTGGCCT
AATAAGGAGG AAACTCTTAT TGCTAAAGAT CGCCGCATTT ACGTTGCCAC CTATCCAACA
ATGCTGAACA TCATCAGGGA TGAAGCGCAG CCTCTTTCGC CGCACTTTTT TGATTTCATC
GTAGTTGATG AAAGCCATCG CTCCATTTAC AACACCTATG GCGAAGTTCT TGATTATTTT
AAAACGCTCA CGCTTGGATT AACGGCTACA CCTACCAACG TTATTGATCA CAACACCTTC
CAGCTTTTTC ATTGCGAAGA TGGGCTTCCA TCCTTTGCCT ATACCTATGA AGAGGCTGTA
AATAATGTGC CGCCTTACTT GTGCAATTTT CAGGTTATGA AAATTCAGAC CCGCTTTCAG
ATGGAGGGCA TTAGCAAGCG TACCATTTCG CTTGACGATC AAAAAAAGCT GATGCTTGAA
GGCAAGGAGG TTGAAGAAAT CAACTTTGAA GGTACGCAGC TTGAAAAGCA AGTAACCAAC
AAAGGCACCA ACACACTCAT TGTGAAGGAG TTTATGGAGG AGTGCATCAA GGATCAACAT
GGCGTATTGC CTGGAAAAAC CATCTTTTTT TGCTCTTCCA CAAAACATGC TCGGCGTATT
GAAGAAATTT TTAACGCTCT TTATCCCGAA TACAAAGGTG AACTTGCTAA AGTGCTGGTT
TCTGATGATT CCCGTGTTTA TGGTAAGGGT GGATTGCTTG ACCAGTTTAA AACCAACGAT
ATGCCTCGCA TTGCCATTAG CGTTGACATG CTCGATACGG GCATTGATGT GCGCGAAATT
GTCAACCTTG TGTTTGCTAA ACCTGTTTAC TCATACACCA AGTTTTGGCA AATGATTGGG
CGCGGCACTC GTTTGTTAGA AACCAGCAAA CCCAAACCTT GGTGCACCGC AAAAGATGTT
TTTCTCATCC TCGATTGTTG GGACAACTTT GAATACTTCA AGTTGAATCC CAAAGGCAAA
GAGCTACCAT CGCAACTGCC ATTGCCCGTG CGCTTTGTTG GCTTACGGAT TGATAAAATT
GAAGCTGCCA TTGATCGCAA CCGTGTAGAA ATTGCTGAAC GCGAAATAAG CAAGCTACGT
GCCCAAATTG CCCAACTACC TCAAAACTCT GTGGTTATAA AAGAGGCTGC AACTGCATTA
GCGCAAATTG AAGCAGAACA TTTCTGGGAC TTGCTTAATC ATCAAACCTT AGAATTTTTA
CGCACTGAAA TTAAGCCGCT CTTCCGCACT CTTTCGGATG TTGATTTTAA AGCCATGCGC
TTTGAGCGCG ACTTGCTGGA ATACTCCTTA GCTGCTTTGC GTGAGGAAAA AGAAAAAGCC
GAAACCCTGA AGGAAGCTAT TGTTGAACAA ATCAGCGAGT TGCCACTTTC AATTCCTTTT
GTTAAGGCTG AAGAGGAGTT AATTCGTGCA GCCCAAACCA ACTATTATTG GGCAAAAGAT
GATGCGATTG CACTGGAAGA GACGCTGGAC AAGCTCAATA GTCGGCTTGG CGGATTAATG
CAATTCCGCG AGCAAACCGA AGAGAGAGAA ACGGTACACC TTGATTTACG TGATGAAATT
CATCGCAAAG AGATGGTTGA GTTTGGTCCG CAGCATGAAT CGGTAAGCAT TAGCCGCTAT
CGTGAAATGG TTGAGGGTAT GATTGCCGAA TTAACGGAGC ACAATCCCAT TTTGCAAAAA
ATAAAGATGG GCGAAAAGAT TTCCGCAATT GAAGCCGATG AGCTTGCCGC AATGCTCCAC
GCCGAACATC CGCACATTAC CGAAGAGTTG CTACAGCAAG TGTATAACAA TCGCAAGGCG
CATTTCATCC AATTTATTCG GCACATTCTT GGCATTGAGC AATTAAAAAG CTTTCCTGAA
ACCGTGAGTG AAGCCTTTGA ACAATTTATT CAACAGCACA GCAACCTCTC AAGCCGTCAA
TTGGAGTTTC TTAATTTGCT GAAGGGCTTC ATTATTGAAC GTGAAAAGGT TGAGAAGAAA
GACCTTATCA ATGCTCCATT TACGGTGATT CATCCGCAAG GCATTCGTGG AGTTTTCAAA
CCTTCCGAAA TCAATGAAAT ACTGAAATTA ACCGAGCAAC TTGCGGCTTA A
 
Protein sequence
MTISEAQTRS QLINKLLAQS GWNVNDQTQV VAEFDIAISH TQHIAEPLTP YHSHQFSDYV 
LLGKDGKPLA VIEAKKTSKN AALGREQAKQ YCYHVQRQQG GVLPFCFYTN GLETYFWDLE
NYPPRKVVGF PTRDDLERFH YIHRNKKPLT QELINTAIAG RDYQIRAIRA VLEGIEQKRR
DFLLVMATGT GKTRTCIALV DALMRAGHAE KVLFLVDRIA LREQALDAFK EHLPNEPRWP
NKEETLIAKD RRIYVATYPT MLNIIRDEAQ PLSPHFFDFI VVDESHRSIY NTYGEVLDYF
KTLTLGLTAT PTNVIDHNTF QLFHCEDGLP SFAYTYEEAV NNVPPYLCNF QVMKIQTRFQ
MEGISKRTIS LDDQKKLMLE GKEVEEINFE GTQLEKQVTN KGTNTLIVKE FMEECIKDQH
GVLPGKTIFF CSSTKHARRI EEIFNALYPE YKGELAKVLV SDDSRVYGKG GLLDQFKTND
MPRIAISVDM LDTGIDVREI VNLVFAKPVY SYTKFWQMIG RGTRLLETSK PKPWCTAKDV
FLILDCWDNF EYFKLNPKGK ELPSQLPLPV RFVGLRIDKI EAAIDRNRVE IAEREISKLR
AQIAQLPQNS VVIKEAATAL AQIEAEHFWD LLNHQTLEFL RTEIKPLFRT LSDVDFKAMR
FERDLLEYSL AALREEKEKA ETLKEAIVEQ ISELPLSIPF VKAEEELIRA AQTNYYWAKD
DAIALEETLD KLNSRLGGLM QFREQTEERE TVHLDLRDEI HRKEMVEFGP QHESVSISRY
REMVEGMIAE LTEHNPILQK IKMGEKISAI EADELAAMLH AEHPHITEEL LQQVYNNRKA
HFIQFIRHIL GIEQLKSFPE TVSEAFEQFI QQHSNLSSRQ LEFLNLLKGF IIEREKVEKK
DLINAPFTVI HPQGIRGVFK PSEINEILKL TEQLAA