Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0635 |
Symbol | |
ID | 4569789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 719415 |
End bp | 722333 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765233 |
Product | hypothetical protein |
Protein accession | YP_911114 |
Protein GI | 119356470 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATCAG ACTATCCAAT AAAATCACCC CGCAAGCTTA TTGAGGTTGC GCTTCCGCTG GATGCTATCA ATGCAGCTTG TGCCTATGAG AAAATGCCAG GGATAGGTGC TCATCCACGC GGAATCCATC TTTGGTGGGC AAGGCGACCT CTTGCAGCCG CACGAGCGGT TCTGTTTGCG CAACTTGTCA ACGACCCTGG TTACCAACAG GGTTGTGGTT TCAAATATGG CAAGAATAAA AAGGAAGCTG CTATTGAACG AAAGCGACTG TTCAAGATTA TCGAAGAGCT TGTATTGTGG GAAAATACCA CCAACGAGGA AGTTCTGGAA CGCGCCAGGG TTGAGATTCG CCGATCCTGG CGGGAGGTGT GCGAACTGAA TAAAGAGCAT CCGCAGGCTG CCGAGTTGTT CAACCCGGAA AAGATGCCTG CGTTTCATGA CCCTTTTGCT GGTGGTGGAG CTATTCCACT TGAAGCACAG CGATTGGGAC TCGAAGCCTA CGCCTCCGAT CTGAATCCAG TTGCAGTGCT GATTAACAAA GCAATGATTG AGATTCCGCC CAAGTTCGCT GGACGACCTC CGGTCGGACC AGAGATTGAG AACAAACAGG GCAAAAGTCT CGAACTTCCA AAAATTTGGC CGGGAGCTAC CGGACTGGCA GAGGATGTTC GCCGCTATGG GTCGTGGATG CGTGATGAAG CACAAAAACG TATTGGACAC CTTTATCCAC CTGTTGAAGT CACTGAGGAG ATGGCGAGGG AACGGTCTGA CCTCAAACCG CTTGTTGGCA AGAAACTTAC CGTCATTGCA TGGTTGTGGA CAAGAACTGT CAAAAGTCCG AATCCTGCTT TTACCCATGT AAACGTTCCA CTTGTTTCGT CCTATATATT GGCAAATAAA GATGGTAAAG AAGTATACGT AAAGCCAGTT ATCGAGGGTG ATAAGTATTA TTTTTTGATT AAAAATGGTA CTCCAACAAC TGAAGCTAAA GATGGGACAA AAGCAACTGT GCGAGGGGCA AACTTCAGAT GTATAGTTTC AGGCGCTGTA ATTGGAAGCG ACTATATAAA AACAGAAGGT AAAGCGGGAA GAATCGGCCA AAGACTCATG GCTATTGTTG CCGAGGGTAC CCGAGGCCGG GTTTATCTCC CCCCAACATT GGATGCGGAA ACTGTTGCAA AAGATGCTGC TCCAACGTGG CGTCCATCCG GTGATGTACC AGAGAGATTG ACAGGTGGGA CTTGTGTTCC TTATGGATTG AAAGAATGGG GAGATATTTT TACTCCTCGG CAGTTGGTCG CATTGACAAT ATTAAGCGAT TTACTTTCTG AAGTGCGTGA ACTTGTCAGA GAAGATGCTT TAGCTTCTGG CTTATCTGAC AGCGAAAAAG GACTTGGTGA TGGAGGCTCT GGTGTGATTG CTTATGCGGA AGCAGTTAGC GTGTATCTTG CTTTTGCCAT CAATCGCTGT GCGGATTTTT GTAATTCAGT TACACGATGG GTTCCAGGAA ATCAGAAGGT GATGAATCTT TTTGGGAAAC AAGCCATTCC TATGACATGG GATTATCCAG AAGCTGCTAT TCTTGCTGAT ACGGTCGGTG GCTTTGCTCC CGCTTCGAAA TACGTTGCTG ATTGCATTGG GAAGTTGTCT CCTGCCGCTG TCGGTTTTGC CTCTCAAGCT GATGCGCAAA CACAGAGTAT TTCCATGGCT AAAATAATTT CAACTGATCC ACCCTATTAT GACAACATTG CCTATGCTGA TCTTTCTGAT TTTTTTTACG TGTGGCTGAG GAAATCCTTG CGTCTATTTC TTCCAGGCTT GTTTTCCACA ATTACAGTCC CTAAGGTTGA AGAACTCGTA GCGGCTCCTT ATCGTCATGG CACAAAACAG AAAGCTGAAA CGTTTTTTCT TACGGGCATG ACTGAGGCCA TCCATAATCT TGCCGAACAA GCACATCCCG AGGGCCCGGT TACAATTTAC TACGCATTTA AGCAGTCGGA GACAAGTGGA CAAGATGGCA CATCTTCTCC AGGATGGGTA ACGTTTTTGT CGGCAGTGTT AAGTGCCGGT TTTGCTATTG TTGGTACATG GCCGTTACGC AGCGAACAGG AATTCCGGAT GATTGGCATG GGAGCAAATG CCCTTGCATC CAGCATCGTC CTTGTTTGCC GCAAACGTTC TGCTGACGCT CCCTCCGTTT CCCGCCGTGA GTTCATTCGC GAGTTGAACG GCGTATTGCC CGAGGCACTT GACGAAATGA CCAAAGGTTC CGGCGAGGAA CGTTCGCCCG TTGCTCCGGT GGATTTGTCG CAGGCTATCA TTGGCCCCGG CATGGCGGTT TTTTCCAAAT ACTCAGCAGT GCTGGAAGCA GACGGCACAC CGATGAGCGT CAGAACGGCT CTCCAGCTCA TCAACCGTTT TCTTGCCGAG GATGATTTCG ATCCAGATAC CCAGTTCTGC CTTCACTGGT TTGAGCAATA CGGATGGAAT GAAAACCTTT TCGGTGAAGC TGATGTATTA GCCCGCTCAA AATCGACAAG TGTTGACGCC ATGAAGGAGG CCGGTGTGCT CCAGAGTGGC AGCGGCAAAG TCCGTCTGCT CAAATGGGCG GAGTATCCCA GCGATTGGGA TCCGCGTACT GACAAGCGAA TGCCTGTATG GGAAGCACTG CACCAGCTTA TCCGCGCATT GAAGCAAGGA GGCGAATCTG CATCCGGAGC ACTTCTTGCT GCTCTCGGCG GCAAAGCTGA AGCGGTACGT CAACTCGCAT ACCGCCTCTA TACGCTCTGC GAACGACTCG GCCAGGCCGA AGATGCACGA GCCTATAACG AACTGATCAC AAGCTGGACA GGTATCGAAT CTGTTGCCAA CAGCATACCA AAACCGTCCG ATCCGCAACT GTCACTTTTT GATAACTGA
|
Protein sequence | MISDYPIKSP RKLIEVALPL DAINAACAYE KMPGIGAHPR GIHLWWARRP LAAARAVLFA QLVNDPGYQQ GCGFKYGKNK KEAAIERKRL FKIIEELVLW ENTTNEEVLE RARVEIRRSW REVCELNKEH PQAAELFNPE KMPAFHDPFA GGGAIPLEAQ RLGLEAYASD LNPVAVLINK AMIEIPPKFA GRPPVGPEIE NKQGKSLELP KIWPGATGLA EDVRRYGSWM RDEAQKRIGH LYPPVEVTEE MARERSDLKP LVGKKLTVIA WLWTRTVKSP NPAFTHVNVP LVSSYILANK DGKEVYVKPV IEGDKYYFLI KNGTPTTEAK DGTKATVRGA NFRCIVSGAV IGSDYIKTEG KAGRIGQRLM AIVAEGTRGR VYLPPTLDAE TVAKDAAPTW RPSGDVPERL TGGTCVPYGL KEWGDIFTPR QLVALTILSD LLSEVRELVR EDALASGLSD SEKGLGDGGS GVIAYAEAVS VYLAFAINRC ADFCNSVTRW VPGNQKVMNL FGKQAIPMTW DYPEAAILAD TVGGFAPASK YVADCIGKLS PAAVGFASQA DAQTQSISMA KIISTDPPYY DNIAYADLSD FFYVWLRKSL RLFLPGLFST ITVPKVEELV AAPYRHGTKQ KAETFFLTGM TEAIHNLAEQ AHPEGPVTIY YAFKQSETSG QDGTSSPGWV TFLSAVLSAG FAIVGTWPLR SEQEFRMIGM GANALASSIV LVCRKRSADA PSVSRREFIR ELNGVLPEAL DEMTKGSGEE RSPVAPVDLS QAIIGPGMAV FSKYSAVLEA DGTPMSVRTA LQLINRFLAE DDFDPDTQFC LHWFEQYGWN ENLFGEADVL ARSKSTSVDA MKEAGVLQSG SGKVRLLKWA EYPSDWDPRT DKRMPVWEAL HQLIRALKQG GESASGALLA ALGGKAEAVR QLAYRLYTLC ERLGQAEDAR AYNELITSWT GIESVANSIP KPSDPQLSLF DN
|
| |