Gene Cpha266_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1602 
Symbol 
ID4571125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1818591 
End bp1821833 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content50% 
IMG OID639766183 
Producthelicase domain-containing protein 
Protein accessionYP_912047 
Protein GI119357403 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.602122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGGAA ACCACACAGA CCTGACATTT TTCACCAATG ATGCGAACCA GACGCTGCTC 
GACAGGTTCA AAATCACGCT TTCCGATACG CAGCTTTTCG ATGTTTTGGT CGGCTACTTT
CGTGCAAGCG GCTTTTACCA GCTCTGCGAC AGCCTCGAAC CCATAGATAA AACGCGCATT
CTTGTCGGTC TCGGTATTGA CGAAGAAACC GGCCGTGCAA TCAGTGCCTG GCGTGAACAG
ACCACCATTG ATTTCGAATC CCATAAAACA GCCAAAGCCC AATTTCAGCA AACACTGATC
GAGGAAATCG AACACTCTGA AGAGACCGAT GAAAAGCTCG AACACGGCCT GAAAAAGTTC
ATAGCATTCC TGAAATCAGA ATGCACAGAC CCGGCAATCG ACCGCAACCG TGGCGGAAAT
GGCAGAAAAC TGGAAATCCG CGCCTTCCCA TCAAAAAACA TCCATGCCAA AGTCTATATC
GGCCGGTTCG CCCCTGACGA CCGTGATTTC GGTTTTGTCG TTACCGGTTC GAGTAACTTT
TCATATTCCG GTCTGGTTGC AAACCGCGAG TTCAATGTCG AACTTCGTCA GCGTCGCGAT
GTCGAATTCG CCCTCACTCA GTTTGAAGAA CTCTGGGCTC AATCGGTCGA TATTTCAGAG
GAATTCATTG ATGCCGTCCA GAAAAAAACC TGGATGAACG ACACCATCAC ACCTTACGAG
CTTTATCTCA AACTCATCTA CGAATACCTG CAGGAGGACA TCAACCTTCG GGACGACATT
CAGATATTTC TACCCGAAGG CTTCATGGCC CTGCAATATC AGCAGCAAGC TGTACAGCAG
GCAATCAAGA AGCTCAACGA ACACAACGGA GTCTTTCTGG CCGACGTCGT CGGTTTGGGA
AAAACCTTCA TTGCGGCCCA GCTTCTGCAG CAGCTCAAAG GGCGGATTAT AGTCATATGT
CCACCCGTAC TCAAAAGCTA CTGGGAATCG AGCCTTCACG ACTTCCGGGT TCCTGCCCGT
GTGGAATCAC TCGGCAAGCT CGATAAAGTC ATACGTTTCG GACTTGACCG TTTCGACTAC
GTTGTCATCG ATGAAGCCCA CCGGTTTCGG AACGAGAACA CCCGGTCCTA TGCAGACCTG
CTCGACATCT GCCGAGGCAA GAAAGTTATC CTGGTAACCG CTACGCCGCT CAATAACACT
ATCGACGACA TATTCTCTCA GCTCAAACTC TTTCAGGTTC CGAAAAACTC CACGATCCCC
GGTATTCCGA ACCTTGAGCG CTATTTCACA TCGTTACGTA AACACTTCAA CGGCCTTGAT
CGTACCGACC CTGCATACAA ACATGCTATC AAGGAAGTCT CGCAGGAAAT TCGCGAACGT
ATTCTCAAGC ATGTCATGGT ACGTCGTACC CGTACCGACG TCATAACATG GTTCAAAAAC
GATATAGAAA GTCAGGGCCT CTTTTTCCCC GAAGTTCAGG AGCCACGCCG CATTGTCTAC
ACATTTGAAG GTGAACTTGA AACCATTTTC AACCGGACGA TCGGCCTGCT GCGAGAGTTC
CATTATGCCC GTTATATACC CTTGCTCTAC TATACCGGCA GTCGGCAGCT TTCGGAATTC
GAACGGCAGC AGCAGCGCAA CGTTGGCGGC TTCATGCGAG GAATCCTTAT CAAACGCCTC
GAAAGCAGTT TTTATGCGTT CCGCAAGAGT GTTCGCCGTT TTATCGAGTC CTATGAGCGC
TTTCTACACA TGTACAATGG AGGAACCATA TACATCAGCC GCAATATCGA CGTCTACGAC
CTTCTCGACA GCGATGATTT CGAAACACTT GAACGCTACG TTGAAGAGGA AAAAGCGCAG
AAATACGCTT CAGAAGACTT CCGCAACGAC TTCATCACAG ACCTTCAACA CGACATGCAG
CTCCTGCGCC AGATCGAACA GCTCTGGCAG GATGTTACAG AAGATCCCAA GCTCGACGCA
TTCACTGACC GGCTTCGGAA TGATCCCGTT CTGAAAAAAC GTATGGTCGT TTTTACAGAA
TCGAAAGAAA CCGGCGACTA TCTGTTTGAA CAGCTCAACA GGCTTTGGCC CGGTAAGATT
ATGTTTTTCA GCAGCAAAGG CGGGCGCACA GGCGACCCAG CCAAAACGCA TGCCCCGGCA
CAAGCCCGCG ACCTTATCAA AACAGCATTC GACCCGAACG AAAAAAGCTG CGATGAGAAC
ATCCGCATTC TCATTGCCAC AGATGTTCTC TCCGAAGGAA TCAACCTGCA TAGGGCAAAT
GTTCTGATCG ACTACGACCT GCCATGGAAT CCCACCCGTG TGCTCCAGCG TGTCGGCAGG
GTTAATCGTC TCGGAACAAA ACATCCCGAC ATCTACATCT ATAATTTTTT CCCCACAACA
CAATCCGACT CGCATCTTCA GCTCGAAGCC AATATCACCA ACAAAATTCA GATGTTTCAC
GACATCCTCG GAGAAGACGC CAAATACCTT TCAGATGGTG AGGAAATTGG CAGCCAGGAA
CTTTTCGATA CACTGAACCG CAAGCAGGCC TATACCGGCG AAGATGAAGC CGGCGATTCC
GAACTACGCT ACCTTGAAAT GATGCGCAAC CTTCGGGACA AACACCCCGA TCTTTTCGAA
AAGATCAAAC GTCTGCCAAA AAAGGCGCGA TCCGGACGAA AGCTTTCCGG TCTCGATACC
GACCGTCTCG TCACCTTTTT CCGTATCGGT CAGCTGAAAA AGTTCTATCT CAACGAGGGC
GTTGAAAGCA GGGAAATAAC CTTCTTCGAT GCGGCCTCAC TGCTCGAATG TCAGCCTGAA
ACGCCCCGGC AGGCCATTCC TGCAGCCTAT TATCATTGGC TCGAAACCAA CAAACAGCGG
TTTGGGCTCG ATGCCATGCA GGAAGAAGCC CCATCAACCA CTTCAGGCGG TCGTTCTAAC
AGCAGCTATA TCGAAAGACG GTTGAAAGAA AGGGCATTCC GAACCTGCCA GAAATTTACC
GAACCCGATG AGGAGTTCAT CGACGGAGTG CTTCGCATGT TAACTCAGGG GCTGATTGCC
AAAAAGACCG CGCAGACGGT CAAGAAAGCC CTTGAAAAAA CCGACGATCC TCTCGATATG
CTCACTATTC TGCGCAAGCA CATCCGGAAA GAGAGCGAAC CCGTTTCTCA AACGGTCAAA
TCGTCGAGGG CTTGCCGGGA AATCATTCTT TCAGGCTATC AAGTTTCAGG AGACGGAGAG
TAG
 
Protein sequence
MSGNHTDLTF FTNDANQTLL DRFKITLSDT QLFDVLVGYF RASGFYQLCD SLEPIDKTRI 
LVGLGIDEET GRAISAWREQ TTIDFESHKT AKAQFQQTLI EEIEHSEETD EKLEHGLKKF
IAFLKSECTD PAIDRNRGGN GRKLEIRAFP SKNIHAKVYI GRFAPDDRDF GFVVTGSSNF
SYSGLVANRE FNVELRQRRD VEFALTQFEE LWAQSVDISE EFIDAVQKKT WMNDTITPYE
LYLKLIYEYL QEDINLRDDI QIFLPEGFMA LQYQQQAVQQ AIKKLNEHNG VFLADVVGLG
KTFIAAQLLQ QLKGRIIVIC PPVLKSYWES SLHDFRVPAR VESLGKLDKV IRFGLDRFDY
VVIDEAHRFR NENTRSYADL LDICRGKKVI LVTATPLNNT IDDIFSQLKL FQVPKNSTIP
GIPNLERYFT SLRKHFNGLD RTDPAYKHAI KEVSQEIRER ILKHVMVRRT RTDVITWFKN
DIESQGLFFP EVQEPRRIVY TFEGELETIF NRTIGLLREF HYARYIPLLY YTGSRQLSEF
ERQQQRNVGG FMRGILIKRL ESSFYAFRKS VRRFIESYER FLHMYNGGTI YISRNIDVYD
LLDSDDFETL ERYVEEEKAQ KYASEDFRND FITDLQHDMQ LLRQIEQLWQ DVTEDPKLDA
FTDRLRNDPV LKKRMVVFTE SKETGDYLFE QLNRLWPGKI MFFSSKGGRT GDPAKTHAPA
QARDLIKTAF DPNEKSCDEN IRILIATDVL SEGINLHRAN VLIDYDLPWN PTRVLQRVGR
VNRLGTKHPD IYIYNFFPTT QSDSHLQLEA NITNKIQMFH DILGEDAKYL SDGEEIGSQE
LFDTLNRKQA YTGEDEAGDS ELRYLEMMRN LRDKHPDLFE KIKRLPKKAR SGRKLSGLDT
DRLVTFFRIG QLKKFYLNEG VESREITFFD AASLLECQPE TPRQAIPAAY YHWLETNKQR
FGLDAMQEEA PSTTSGGRSN SSYIERRLKE RAFRTCQKFT EPDEEFIDGV LRMLTQGLIA
KKTAQTVKKA LEKTDDPLDM LTILRKHIRK ESEPVSQTVK SSRACREIIL SGYQVSGDGE