Gene Cpha266_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1953 
Symbol 
ID4570128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2262004 
End bp2264907 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content52% 
IMG OID639766535 
Producthelicase domain-containing protein 
Protein accessionYP_912391 
Protein GI119357747 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA CCCTCACCAG CTATCACGCG CAATACTATG CATACGAACT CACGCGAAGT 
CGTACCGCTG ACGACCAGGA CAAATTCACC GCATCCCTGC AAGACGCCAC AGTTGACCTC
AACCCCCACC AGGTAGAAGC CGCACTCTTT GCCTTCAAAT CACCGCTCTC CAAAGGCGCA
CTGCTTGCAG ACGAAGTAGG CCTCGGCAAA ACCATAGAAG CCGGCATCAT CCTCTCCCAG
AAATGGGCGG AACGCAAACG CAAGCTTCTC ATTATAGCCC CGGCAAACCT CCGGAAACAA
TGGAATCAGG AACTTGCCGA CAAGTTTTTT CTGCCATCAA CCATCATTGA AGCAAAATCC
TTCAATCAGA TTATCAGAAG TGGCAACCTG AACCCATTTG AACAGAATGA AATTCTCATC
TGCTCATACC AGTTTGCTCG CGCTAAAGAA ACCTGGCTCC GGCATATACA ATGGGACCTT
GTAATCATTG ATGAAGCCCA CCGCCTCAGG AACGTCTATC GCCCCGACAA CAAAATCGGT
AAATCCATCA AGTCGGCTCT TGCCCACACC CAAAAAGTCC TGCTCACCGC CACCCCTTTG
CAGAACTCCC TGCTCGAACT CTACGGACTC GTCAGCATCA TAGACGACTA CACCTTCGGC
GACCTCAAAA GCTTCAAAAC CAACTACGCC CGCATCGGCG GCAACGAACA ATACAACGAC
CTCAAAGAGC GCCTCAAACC GGTCTGCAAA CGCACCCTCC GCCGTCAGGT ACTCGAATAC
ATAAGCTTCA CCAACCGGCT GGCAATTGTC GAAGAATTCT ACCCCTACCA AGAAGAACAG
CAGCTCTACG ACGACGTCTC CGACTACCTC AGAAGCGACA AACTCTATGC GCTCCCCGCA
AGCCAGCGCC ACCTCATGAC CATGATTTTG AGGAAACTCC TCTCCTCATC AACCTTTGCC
ATACAAGGAA CACTCCAGAA ACTCGCCACC AAACTCGACG CGCACCTCCA AGGCCAAACC
GCCATCCAGC TCGACGACAT AGCCCAAGAC TACGAAGGCT TTGAAGAACT CGCCGACGAA
TGGGCCGAAA ATGACGAAAC AAAGCGCAAC AAACCCGAAC CCATACCCGA AGAAGAACGC
CCGCAAGCGC TCGAAGAAAA ACAAAAACTC CAGCAATTCG CCGCACTTGC CAGCACCATC
CAGAAAAACT CCAAAGGCGT TAAACTCCTT ACAGCGCTCA CCAAAGGCTT CGAAAAACTG
CAGGAACTCG GTGCAGCAAA AAAAGCCATC ATCTTCACCG AATCGACCCG AACCCAGCAA
TACCTCAAAG AAATCCTCGA AACCCAGGGC TATGCCGGTC AGATCGTCCT CTTCAACGGC
ACCAACAACG ATACGCACTC CCGCGCCATC TATACAGACT GGATGCAGCA CCATGCCGGC
ACCGACCGCA TCAGCGGCTC ACACAGCGCC GACAAACGCC AGGCAATCGT TGACTGGTTC
CGCAACGAAG CCACCATCAT GATCGCCACC GAAGCCGCAG CCGAAGGCAT CAACCTCCAA
TTCTGCTCGC TCGTCGTCAA CTACGACCTC CCCTGGAACC CCCAGCGCAT AGAACAGCGC
ATTGGTCGCT GCCACCGCTA CGGTCAGAAA TTCGACGTCG TCGTCATCAA CTTCCTCAAC
AAAGCCAACG CGGCCGACCA GAGAGTCTAC CAGTTGCTCG ATCAGAAATT CAAACTCTTC
AGCGGAGTAT TCGGTGCAAG CGACGAAGTG CTCGGAGCCA TTGAAAGCGG AGTTGACTTC
GAAAAACGCA TTGCCAGAAT TTACAAAGAG TGCCGAACCG CACAGGAAAT TCAGGAAGCA
TTCGACGCAC TCGAAGCAAC CTTTGAAGAA GAGAAACAGC AGAAACTCGA CAACACAAAA
CTGCAACTGC TCGAAAACTT CGACGACGAA GTTCACCGGA AACTCCGAGT CAACCTCCAG
CAAGGCAAAG AATATCTCAA TCTTTTCGAG CAACGCCTTT GGGGCATAAC AGAATGGGCG
CTCAACAGCA GCGCCGACTT CCATCCCGAA ACATACTCAT TCACCCTCAA AGCAAATCCA
TTTCAGAACC CTGCTATTCA CCTTGGGAAC TACCAGCTTC TGAAATCCGC AACCGATCGC
AAAAAATCGG AAATCGACCT TGCCGCAACC GCCAACATCT ACCGCATAGG CCACCCCCTT
GCCCAATCCG TCATCGAAAG CTGCAAAACC CGCAACCCCG GCTCAACGCA CCTGCTCTTC
GACTACGCCA ACACACTGCT GAAAATAACG GTACTCGAAC CATTGCTTGG CCACTCGGGG
TGGCTGACGC TCTCGCTCCT GAGCATCAGC TCATTCGAAA AAGAAGAACA TCTCATCTTC
TCAGGCACAA CCGACGACGG AACGCCTCTC GATGGCGAAC TCTGCCAGAA ACTCTTCAAC
CTCTCCGCAA AAGAAATGGA GACGGTAACC ATACCAAAAG AGACAACAGC AACCCTCAAC
GCCATCAGAA CACAGCAGAT ACAGGCAATA CTCGATAACT CCATGCAGCG CAACGCCCGT
TTCTTCGACG ACGAATACGA AAAACTCGAC AAATGGGCTG ACGACATGAA GCTCAGTCTT
GAAAGGGAAA TCAAAGACCT CGATGCAGGA ATCAGGCTCC GCAAGGCAGA AGCCCGAAAA
CTCTCCGATC TCGAAAGCAA AGTAAAAGAA CGCCGCCATG TCAAGGAACT CGAAAAACTT
CGCGACGACA AACGGCGCCA CCTTTTTGAA GCGCAAGACC AGATAGAAAG CAAAAAAGAC
GGATTACTCG AAGATGTTGA AGCAAGAATG GCGAGCCGCA CAGAAAAAGA AACGCTCTTT
ACAATACGAT GGTCATTAAA ATAA
 
Protein sequence
MNTTLTSYHA QYYAYELTRS RTADDQDKFT ASLQDATVDL NPHQVEAALF AFKSPLSKGA 
LLADEVGLGK TIEAGIILSQ KWAERKRKLL IIAPANLRKQ WNQELADKFF LPSTIIEAKS
FNQIIRSGNL NPFEQNEILI CSYQFARAKE TWLRHIQWDL VIIDEAHRLR NVYRPDNKIG
KSIKSALAHT QKVLLTATPL QNSLLELYGL VSIIDDYTFG DLKSFKTNYA RIGGNEQYND
LKERLKPVCK RTLRRQVLEY ISFTNRLAIV EEFYPYQEEQ QLYDDVSDYL RSDKLYALPA
SQRHLMTMIL RKLLSSSTFA IQGTLQKLAT KLDAHLQGQT AIQLDDIAQD YEGFEELADE
WAENDETKRN KPEPIPEEER PQALEEKQKL QQFAALASTI QKNSKGVKLL TALTKGFEKL
QELGAAKKAI IFTESTRTQQ YLKEILETQG YAGQIVLFNG TNNDTHSRAI YTDWMQHHAG
TDRISGSHSA DKRQAIVDWF RNEATIMIAT EAAAEGINLQ FCSLVVNYDL PWNPQRIEQR
IGRCHRYGQK FDVVVINFLN KANAADQRVY QLLDQKFKLF SGVFGASDEV LGAIESGVDF
EKRIARIYKE CRTAQEIQEA FDALEATFEE EKQQKLDNTK LQLLENFDDE VHRKLRVNLQ
QGKEYLNLFE QRLWGITEWA LNSSADFHPE TYSFTLKANP FQNPAIHLGN YQLLKSATDR
KKSEIDLAAT ANIYRIGHPL AQSVIESCKT RNPGSTHLLF DYANTLLKIT VLEPLLGHSG
WLTLSLLSIS SFEKEEHLIF SGTTDDGTPL DGELCQKLFN LSAKEMETVT IPKETTATLN
AIRTQQIQAI LDNSMQRNAR FFDDEYEKLD KWADDMKLSL EREIKDLDAG IRLRKAEARK
LSDLESKVKE RRHVKELEKL RDDKRRHLFE AQDQIESKKD GLLEDVEARM ASRTEKETLF
TIRWSLK