Gene Cpha266_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1301 
Symbol 
ID4570841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1484498 
End bp1487374 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content54% 
IMG OID639765891 
Producthelicase domain-containing protein 
Protein accessionYP_911757 
Protein GI119357113 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATAA CCTTTCAACC AGGAAAACTG CTCTCCCTTC GAGGGCGCAA TTGGATTGTC 
ATGCCCTCCG ATGACCCTGA TCTTCTGGTC ATCAAGCCTC TTGGTGGCTC TGACGATGAG
ATTGCTGGCA TTTATCTCCC CCTTGCTATT CCCCGTGATG AGCCACAGGA GACCGAGTTT
GGCCAACCAA CCGGCAGTGA CCTTGGCGAT ATCAGTACCG CCCGACTGCT CTACGATTCA
GCCCGGTTGG CATTTCGAAA TGGCGCAGGC CCCTTTCGTG CACTGGCAAA GCTCTCTTTT
CGTCCACGCT CCTACCAAAT GGTTCCTTTG GTAATGGCGC TTCGGCAGGA GCTGATTCGC
CTCTTGATTG CTGATGATGT TGGCGTCGGT AAAACCATCG AAGCCCTCCT TATTGCCAAA
GAGATGCTGG AGCGCCGCAA AATCCAGCGC TTCGCGGTTG TCTGCCTCCC TCACCTCTGC
GAACAGTGGC AGCAGGAGAT TCGCGACAAA CTCGATATTG AGGCGGTTAT CATCCGTTCC
AACACTCAGG CACGCCTTGA CCGGCAGATA CAGGGCGATA CCAGCGTTTA CGATTACTAC
CCCTACCAGG TTATCAGTAT TGATTTCATA AAATCTGACA ATCGTCGCGA TGTTTTTGTG
CAGCAGTGCC CCGAACTGCT CATTGTTGAC GAAGCTCACA CCTGCGCCCG CCCGGCTGGA
GCCTCAAAAA GCCAGCAGCA GCGTTACCAT CTGCTGAGTC GTCTGGCAGG CAAGCCAGAG
CAGCAACTCA TTCTTCTCAC GGCCACTCCT CACTCCGGCA AGCCGGAAGA GTTTCACTCC
CTGCTCGGCT TACTCAAACC GGGGTTTGAA GTGCTTGATT TGCCGACGGC ATCACAGCCC
CAACGCAAAG AGCTGGCTCG TCATTTTGTG CAGCGCAAAC GTGGTGATGT TGAAAAGTGG
ATGGGTGAAG AGACTCCCTT CCCCAAGCGT GAAGCTATTG AGTGGGCTTA TGATCTCTCT
CGGCAGTATG AACTGTTTTT TGACCAGATC CTCGAATTCG CAAAAAAACT GATTGCCTCG
GACACTTCAA AAAAGGGCAC CCAACGGGTA CAGTACTGGA CGGCGCTTGC CCTGTTACGT
GGAGTAATGT CGAGCCCGGC TGCCGGTATT GAAATGCTCA ACACCCGACT CTCCAACCTT
GCCCATGTTG CCCTTGATGA AGGGTTGGCC GAAAACGGTG AAAACCCTGT GCAGGATAGT
GAGTTTGGCT TTGAGGGCGA CAACACACCG ACCTCCCTGC TTGAGCAAAC CGACTGGTCA
AGCTACCAGC GGCAACAACT GCGAGGATTT GCGGATCAAC TGGCAACCCT CTCTGACTTG
AAGCACGACC AGAAATGCGG GACAGCGGAA GCTATTCTTG AAGATTGGCT CTCGGCTGGC
TTCAACCCCG TTGTCTTTTG CCGCTACATC GCAACCGCAA ACTACGTCGG AAAGTTGCTG
GTTCCAGCCC TCCACAAAAA ATATCCCAAG CTTGATATTC AGGTTATCAC CAGTGAATTG
CCCGACGAAC TCCGCAAGCA GAAAATTGAT GAGATGGGCA AGGCAAAGCA CCGGCTCCTG
ATTGCAACCG ACTGCCTCAG CGAAGGCATC AACCTTCAGC AGCAGTTTAC GGCGGTGCTG
CATTACGATC TCCCCTGGAA TCCCAACCGC CTCGAACAGC GCGAAGGGCG GGTTGACCGT
TTTGGTCAGC CAGCGCCGGA AGTCAAAACC TGCCTTCTCT ATGGTGCGGA TAATCCTATC
GACGGTATTG TGCTTGATGT CTTGCTGCGC AAGGTTCGCG AAATTAAACG TTCCACCGGA
ATCAACGTCC CTTTTCCGGA AGATTCCCAG AGCATTATAG ATACCATCAC CAAGGCGCTT
TTGCTCAACC CCGATCGCAA AATCACCCGG CGGCGTGAAG GAAAGCAACA GTTGGCCTTC
GACTTCAGCG AGTTTGATGA AGCTCTCTCT GCCAAAGCCA CCATCTCCCG CAAAATAGAG
GAGGCCGCCG AGCGCGAAAA AGCGACCCGT ACCATCTTTG CCCAGAACGC CATCAAGGCC
GGCGAGATAG AGGCTGACCT TCGGGAGGTG GATGAGGCCA TTGGTGACCC CTTTGCCGTT
GAACAGTTCG TGACCAGCGC CCTCAACAAC CTCTTTGGCG TACAGGTTAT TGCCCAGCAG
AAAAAGGGAT GCTATCGCCT CATTACCGCC AACCTCCCCG ACCAGTTGCG CGGCATCCTG
CCCGTTGGCG AAATTGTCCA AATCAGCTTT CTCTCGCCAA CGCCTGAAGG CTATCACTAC
CTTGGGCGCA ACAACCGCTT TGTGGAACAA CTCTGCCAGC TCCTCATGGC CAACACCGTC
AACCGTTCCG GCAAGCGGGC CGCCCGTTCC GCCGTCATTC GCACCCGGCA GGTGGCCATC
AAAACAACCA TTCTCCTCTT CCGCTGCCGC AACGTCATCG AAGACCGGAG AGGGAGCCAG
CAGATTGTGG CCGAAGAGAT GATTCTCTGG GGCTGGAAAG GGACACCACA GGAGAAGGAG
TTCCTCGATC AGGCAGAGGC AAAAGCGCTC CTCGCGTCAG TTCGTGCCAG CTCCGATATG
TCGCTCCCTG CCAGAACAGG GTTTCTGGAG AATGAATTGA AACTGCTCAA ATCGTTTGAA
GCTGAATTTG ACCGCGTGGC TGAACAACAA TCAAAAAAAT TAGTCGAAGC CCATGAACGC
TTCAGTTCGT TGATGGACGG CAAGCGTTTT CAGGTGGTGC ATCCGGTACT GCCAATGGAT
CTGCTCGGCG TTTATATTCT TTTGCCGGAA GGCGAAGCTG CGGGAGGTTC CGCATGA
 
Protein sequence
MSITFQPGKL LSLRGRNWIV MPSDDPDLLV IKPLGGSDDE IAGIYLPLAI PRDEPQETEF 
GQPTGSDLGD ISTARLLYDS ARLAFRNGAG PFRALAKLSF RPRSYQMVPL VMALRQELIR
LLIADDVGVG KTIEALLIAK EMLERRKIQR FAVVCLPHLC EQWQQEIRDK LDIEAVIIRS
NTQARLDRQI QGDTSVYDYY PYQVISIDFI KSDNRRDVFV QQCPELLIVD EAHTCARPAG
ASKSQQQRYH LLSRLAGKPE QQLILLTATP HSGKPEEFHS LLGLLKPGFE VLDLPTASQP
QRKELARHFV QRKRGDVEKW MGEETPFPKR EAIEWAYDLS RQYELFFDQI LEFAKKLIAS
DTSKKGTQRV QYWTALALLR GVMSSPAAGI EMLNTRLSNL AHVALDEGLA ENGENPVQDS
EFGFEGDNTP TSLLEQTDWS SYQRQQLRGF ADQLATLSDL KHDQKCGTAE AILEDWLSAG
FNPVVFCRYI ATANYVGKLL VPALHKKYPK LDIQVITSEL PDELRKQKID EMGKAKHRLL
IATDCLSEGI NLQQQFTAVL HYDLPWNPNR LEQREGRVDR FGQPAPEVKT CLLYGADNPI
DGIVLDVLLR KVREIKRSTG INVPFPEDSQ SIIDTITKAL LLNPDRKITR RREGKQQLAF
DFSEFDEALS AKATISRKIE EAAEREKATR TIFAQNAIKA GEIEADLREV DEAIGDPFAV
EQFVTSALNN LFGVQVIAQQ KKGCYRLITA NLPDQLRGIL PVGEIVQISF LSPTPEGYHY
LGRNNRFVEQ LCQLLMANTV NRSGKRAARS AVIRTRQVAI KTTILLFRCR NVIEDRRGSQ
QIVAEEMILW GWKGTPQEKE FLDQAEAKAL LASVRASSDM SLPARTGFLE NELKLLKSFE
AEFDRVAEQQ SKKLVEAHER FSSLMDGKRF QVVHPVLPMD LLGVYILLPE GEAAGGSA