Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1235 |
Symbol | |
ID | 4570253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1400533 |
End bp | 1403556 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639765826 |
Product | SNF2-related protein |
Protein accession | YP_911692 |
Protein GI | 119357048 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.224345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCCT TGCATTGTTC GGTTCTGGAC GGAATACCGA TGCTCTGGAG TGAGGGCCGT GTTCCGGGCG ATATCAAAGA ACTTCGCCGT GCGCTCAAGG CAATCGGAGT GACCTTTACG ATTCGTAAAC CGATGACGAG AGAGGTTGTC GCGTGGCTTC CTTGCCGGGG GGAGGAGCGG GTGCCTTCTT CGCCGTTGAT CGGTGATGAG CCTGATAAAC GGCGAAAATC AACTCTTAAG CCCTTTACGA TTACCGCGCT ACCTCTTGAG ATTTCCTTGA GCCTTGCATT GTTCGCTTGC GCCCGTGGAG GCAATATTCC TGGTACAGGT GCTTTTTTCG GAACATCTTT TTTATGGACA AAGCATCTTT TTGAAGCCGC CGTGCATCTT GTTTCATCTC AGATGTTCCT GCCTTCGCTT ATCCGGAGCG GGCAGCAGCG GGAAGGTCGA TGGGTTCCCT ATCCTGATAA TGCCGCATCG CTGCATCTGA ACTTGCTTCT TGAAACCATG CCTCCGGTAT GCCGTTGTTT GTCAAGTGAC GGAAACTCCC TGCCGAAATC CTCACGGAAA GAGGTGCAGG AACAACTGCT CTTTCTGACG GTTGATTCGC TTGTACGGTT TTTATCAGCC ACGACAGTTC GCAACAACGG AAAACCGGCG ACTGTTCATG ACGCATGGAT GCATGCAATT ACCGGTACCG ATCCGGCAAT GAGATGGGAG AACAAGTCAG AGGTTGAAGT ATTTGCGCAT GAACTTGAGA AGTGGCGACG CCCTCTGGAT GTTTTCGCCC GTTCTCCTTT CAGTTTCTGC TTTCGGCTTG CCGAACCGGA ACAGAACGGC AGAAAAAAAG ATGTCTGGCA GCTGGTCTTT CTGCTGCAAC TGAAGAGTGA TCAGAGCTTG CTGCTTGAAG TCGGAGCACT CTGGGATCCT GAAAGCAGTG CGTCGCTACA GGTGAAAAGC TATGGCGGTG AGTGCAGGGA GTTCATGCTT ACGGCCCTCG GACAGGCAGC AGCTCTCTAC CCGGAACTCT CTGCCGGATT GAAGCTGAAA AAACCGGGAG CGTTGAAGCT CGATACCGAT GGGGCATTCC GGTTTCTTGC CGGGTATGCC GAACTTCTGC AGAGAGCCGG ATTTGTGGTT ATGCTGCCGT CATGGTGGAT AGGCCGGGGG CCGGTGAACA GAATGGGCAT CAAGGTGAAC GTTAAAGCTC CTGCCATGCA ATCAAGCGGC AGCAAGTCAG GGCTCGATAC ACTGCTTTCG TGTGATTATG CCGCATCGCT CGGCAACGAT GAGTTTGATC TGGATGAGCT TCGCCGTCTT GCGGAGCTGA AAATGCCGCT GGTCAGGGTA AGAGGGCAGT GGAGGCAGAT TGAGCAGCGT GAGCTTGCTG ATGCGTTGCG TTTTCTTGAA AAACGGCAAT CCGGCGAGCT TTCTGTACGG GATATTCTTG CAGCGGCTTT TGGCGCCGGG CCAAAAGAGA CGGCTTTGGT TCATCGAACG GTTGATGCCG ATGGATGGAT GAAGGATCTT CTCGATAAGT TGAAAGGATA TACCCGTTTT GAGCTGCTCT CACAACCGGA TCGTTTTGAG GGAAAACTTC GCGAATATCA GGTGAGGGGT TTTTCATGGC TTGCATTTCT GAGAACCTGG GGGCTTGGCG CATGTCTTGC TGATGATATG GGTCTTGGTA AAACGATACA GACGCTCGCC CTGCTGCAAC AGGAGCGGAA CCTGGGCGAA AAACGTCCGG TTCTGTTAAT CTGTCCGACC TCGGTTGTCA ACAACTGGAG AAAGGAGGCG GAACAGTTTA CGCCGGATCT GGCTGTGCTG GTGCACCATG GCTCCGATCG GCTGAAAACC GCTGCGTTCA GGAGAGCAGC CGCTAAATCG GCACTGGTGA TTTCAAGTTA TGGCCTTTTG TTGCGCGATA TAGCGTCTTT ATCAAAGCAG CAATGGGCGG GGGTGATTCT TGACGAAGCG CAGAACATCA AGAATCCTGA AACGAAACAG GCAAAAGCTG CCCGAACACT GCAGTCAGAT TACCGGATAG CGCTTACCGG TACGCCGGTT GAAAATCATG TTGGCGATCT TTGGGCTTTG ATGGATTTTC TCAACCCCGG TTTTCTTGGC GGTCAGGCGT TTTTCAAGGA GCACTTTTAT TATCCGATCC AGTGGAATGG CGACATCGAC GCATCGGAAC GGCTCAAAGC CATAACTGCG CCGTTTATTC TTCGCCGTCT CAAAACCGAT ACATCGATTA TCTCCGATCT GCCCGACAAA ATAGAAATGA AGCAGTACTG TACCCTCACC AGAGAGCAGG CCTCACTCTA CAAGGCGGTT ATTGATGAAT TGCAGGAGAA AATCGAAACG GCGGAGGGTA TCGATCGGCG AGGGCTTGTG TTGGCGCTGC TGGTGAAGCT TAAGCAGGTT TGTAATCATC CGGTTCAGTT TCTCGGAGAT AATTCGTCTG TTGAGCATCG TTCCGGAAAG TTGCAGCGTC TGACGGAGTT GCTGTCGGAA ATCCGTGAAT GCGGGCAACG GACTCTTGTT TTTACACAGT TTATGGAAAT GGGAAAGATT TTGCAGCGCT ATCTTCAGGA ACTGTTCGGT GAAGAGGTGT TTTTTCTGCA CGGTTCTCTC TCCAGAAAAA AGCGTGACGC GATGATCGAT GCTTTCCAGC AGGGTGAACA TGCGCCGCAT ATTTTCATTC TTTCGCTGAA AGCCGGGGGA TCCTGTTTAA ACCTGACCAA CGCAAACCAT GTTGTGCATT ACGATCGATG GTGGAACCCG GCGGTTGAAA ATCAGGCGAC GGACAGGGCT TTCCGTATCG GGCAGAAACG GAATGTGGAA GTCCATAAGT TCATCACCGC CGGAACCCTT GAAGAGCGGA TCGATGAGAT GATCGATAAA AAGAGAGCCG TTTCGGGTTC GGTTCTCGGC ACAGGAGAGC AGTGGCTGAC AGAGCTCTCG AACAGTGATT TGAAAAAACT CATTATGCTT GGAAAGGAAG CAACCGGAGA ATAA
|
Protein sequence | MIALHCSVLD GIPMLWSEGR VPGDIKELRR ALKAIGVTFT IRKPMTREVV AWLPCRGEER VPSSPLIGDE PDKRRKSTLK PFTITALPLE ISLSLALFAC ARGGNIPGTG AFFGTSFLWT KHLFEAAVHL VSSQMFLPSL IRSGQQREGR WVPYPDNAAS LHLNLLLETM PPVCRCLSSD GNSLPKSSRK EVQEQLLFLT VDSLVRFLSA TTVRNNGKPA TVHDAWMHAI TGTDPAMRWE NKSEVEVFAH ELEKWRRPLD VFARSPFSFC FRLAEPEQNG RKKDVWQLVF LLQLKSDQSL LLEVGALWDP ESSASLQVKS YGGECREFML TALGQAAALY PELSAGLKLK KPGALKLDTD GAFRFLAGYA ELLQRAGFVV MLPSWWIGRG PVNRMGIKVN VKAPAMQSSG SKSGLDTLLS CDYAASLGND EFDLDELRRL AELKMPLVRV RGQWRQIEQR ELADALRFLE KRQSGELSVR DILAAAFGAG PKETALVHRT VDADGWMKDL LDKLKGYTRF ELLSQPDRFE GKLREYQVRG FSWLAFLRTW GLGACLADDM GLGKTIQTLA LLQQERNLGE KRPVLLICPT SVVNNWRKEA EQFTPDLAVL VHHGSDRLKT AAFRRAAAKS ALVISSYGLL LRDIASLSKQ QWAGVILDEA QNIKNPETKQ AKAARTLQSD YRIALTGTPV ENHVGDLWAL MDFLNPGFLG GQAFFKEHFY YPIQWNGDID ASERLKAITA PFILRRLKTD TSIISDLPDK IEMKQYCTLT REQASLYKAV IDELQEKIET AEGIDRRGLV LALLVKLKQV CNHPVQFLGD NSSVEHRSGK LQRLTELLSE IRECGQRTLV FTQFMEMGKI LQRYLQELFG EEVFFLHGSL SRKKRDAMID AFQQGEHAPH IFILSLKAGG SCLNLTNANH VVHYDRWWNP AVENQATDRA FRIGQKRNVE VHKFITAGTL EERIDEMIDK KRAVSGSVLG TGEQWLTELS NSDLKKLIML GKEATGE
|
| |