Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2496 |
Symbol | |
ID | 6376192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2664571 |
End bp | 2667459 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642684973 |
Product | protein of unknown function DUF1156 |
Protein accession | YP_001960871 |
Protein GI | 189501401 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGCA AGAAAAAACT TATAGAAGTA GCCCTGCCGC TGGAAGTGAT CAATGTTGCC AGCGCGCGGG AAAAATCGAT ACGGCACGGC CATCCTTCGA CGCTGCATTT GTGGTGGGCG CGGCGGCCGC TGGCGGCGGC GCGAGCGGTG ATCTTTGCGC AGATGGTTGA TGACCCTTCG GCCCATCCTG ACCTGTTTCC CACTGAAAAG AAGCAGGAGA AAGAGCGGCA GCGGTTGTTC CGCATCATCG AGGATCTGGT GAAGTGGGAG AACACCACGA ACGAAACGGT ATTGCAGCAG GCGCGCGACG AGATTTGGCA GAGCTGGCGC TACACCTGCG CCGAGAATGC TGACCACCCG AGAGCTGCCG AAATCTTCGA CCGTTACAAG CTGCCGGATT TTCACGATCC CTTTGCTGGC GGCGGAGCCC TGCCGCTCGA AGCACAACGG CTCGGGATGG AGAGCTATGC AAGCGACCTG AATCCGGTGG CCGTACTCAT CAACAAAGCG ATGATCGAGA TTCCTCCGAA GTTTGCCGGA AAGCCGCCGC TCAACCCTGC GTGGTACAAC AAGTCCGAAA CGGAGAAGAT GGGGCGAGAG TGGAAAGGTG CGCAGGGGCT TGCCGGCGAT GTGCGGTATT ACGGCCAGTG GATGCGTGAC GAAGCTGAAA AGCGAATAGG CCATCTCTAT CCCAAAATCG AGGTTACGGA AGCGATGGCT GAAGAGCGGA CCGATCTGAA AAAATATGTC AGCAGAAAGC TCACGGTGAT TGCCTGGCTC TGGGCACGAA CGGTCAAAAG CCCCAACCCG GCCTTTGCCG ATGTCGATGT GCCGCTTGCC TCGACCTTCA TGCTTTCCAC CAAGGCAAGC AAGGAGGCGT ATGTTGAACC GGTCATCGAA GATGGCGGGT ACCGGTTCAC TGTAAAGATT GGTAAACCGA CAGATGTGGA TGCGGTAAAG CGAGGCACAT CAGCAGGGAA ACGTTCGGCT TTCCGCTGCC TTATGTCCAG CGCTCCTGTT TCCTATGACT ATATTCGTGA GGAAGGAAAA GCCGATAGAA TGGGGGTAAA GTTGATGGTG ATTGTCGCAG AAGGCGATCG GGGGCGAGTC TATCTTTCGC CGACGGAGGA GATGGCAGCG ACTGCGTTGC AAGCCACACC GGACTGGAAA CCTGACACCC CATTGCATGG AAAATGCCGC GTCAATGTCT CAAATTATGG TTTGGATGTT TATGGTGATC TCTTCACCCC GCGCCAGCTT GTGGCGCTGA CGACCTTCTC TGATCTGGTG CAGGAAGCGC GTGAACGGGT AAAGCAGGAT GCCATTCAGG CCGGTTTGCC CGATGATGGC AAGCCTCTTG CAGAACAGGG CGCTGGCGCG GCTGCCTATG CTGACGCTTT GGCGGTGTAT CTTGCGTTTG TCATAGATAA AACATCGGAT AGAGGCTCAA CGATATGTAG CTGGGATTCT TCGAGAGATA GCTTGAGAAA TACATTTGGT CGTCAGGCAA TACCAATGAC GTGGGACTTT GCTGAGTCGA ATGTTTTATC GGAATCCACT GGTGGTATCT CAAGCGGATT AGATCAAGTT TATCGCGTTT TGGTTGGATT GCCGCAAAAT ACTGTTGGGA AATCTTCTCA ACAAGACGCA CAAACGCAAT CTATTTCAAG AGATAAGGTT ATCTCAACGG ACCCACCCTA CTACGATAAT ATAGGTTATG CCGATTTGTC TGATTTTTTC TATGTCTGGA CACGTCGCTC ATTAAAGTTC GTTTTCCCTG ACCTTTTCTC CACGCTTGCC GTTCCAAAGG CTGAAGAGCT GGTGGCAACC CCCTATCGCC ACGGCAACCG CGAAAAAGCG GAAACATTCT TTCTCAATGG CATGACGCAG GCGATGCACC GGCTTGCAGA GCAGTCTCAC CCAGCCTTTC CGGTGACGAT TTACTATGCC TTCAAACAGT CGGAGACTGG AAATGATGAC GGTACGACCA ATACCGGTTG GGATACCTTT CTTGCTGCCG TGATCGAGGC CGGTTTTTCC ATCAGCGGAA CCTGGCCGAT GCGCACCGAA CTGAGCAATC GCATGATCGG TTCAGGTACT AACGCCCTTG CTTCAAGTAT CGTGCTTGTT TGTCGTAAGC GTCCGGAAAA CCCGCCAGTC GCCACGTTCC GCGAATTCGA TACCGCGCTC AAGTCTGAAT TGCCTCAGGC GCTTTCGCAA TTGCAGGCGG GCAATATCGC ACCGGTCGAT CTTGCGCAAG CGGCCATCGG CCCCGGCATG GCGGTCTATA CGCGCTATGC TAGCGTGCTC GATGCGGAAG GCAGGCCTTT AACGGTCCGT GCGGCCCTTG CCCGGATCAA CAAGGTGCTC GACGAAGCGT TGGCCGAGCA GGAGGGCGAT TTCGATGCCG ACAGCCGATG GGCACTGGCC TGGTTCGAGC AGATGGGCTT CAACGACGCT GAATTCGGTA CTGCCGACGT GCTGGCGCGG GCAAAGGCGA CCTCGGTTGG CGGTATTGTC GAGGCGGGTA TCGCTTTTTC CGGAAAGGGC AAGGTGCGTC TCTTCAGGCC GTCGGAACTT CCCTCCGATT GGGATCCATC CACCGACAAG CGATTGACCG TCTGGGAGAT GGTGCATCAG CTCATCCGCG TGCTGGAAGC GGAGGGTGAG CCTGCGGCGG CGGAACTGGT CGCAAAACTT GGCAGCGAGG CCGAAACGGC GCGTGAACTC TGTTATCGGC TCTATACCCT CTGCGAGCGC AAAAAGCGAC CTCAGGAGGC GATGGCCTAC AATGCCCTCG TGCAGAGCTG GCCTGAACTC ACCCGGCTTG CAGGCGAAAT GCCCTCCCTC CCGGCTTCCG GTACCTATAA CCTGTTTGAA AACGAGTAA
|
Protein sequence | MTCKKKLIEV ALPLEVINVA SAREKSIRHG HPSTLHLWWA RRPLAAARAV IFAQMVDDPS AHPDLFPTEK KQEKERQRLF RIIEDLVKWE NTTNETVLQQ ARDEIWQSWR YTCAENADHP RAAEIFDRYK LPDFHDPFAG GGALPLEAQR LGMESYASDL NPVAVLINKA MIEIPPKFAG KPPLNPAWYN KSETEKMGRE WKGAQGLAGD VRYYGQWMRD EAEKRIGHLY PKIEVTEAMA EERTDLKKYV SRKLTVIAWL WARTVKSPNP AFADVDVPLA STFMLSTKAS KEAYVEPVIE DGGYRFTVKI GKPTDVDAVK RGTSAGKRSA FRCLMSSAPV SYDYIREEGK ADRMGVKLMV IVAEGDRGRV YLSPTEEMAA TALQATPDWK PDTPLHGKCR VNVSNYGLDV YGDLFTPRQL VALTTFSDLV QEARERVKQD AIQAGLPDDG KPLAEQGAGA AAYADALAVY LAFVIDKTSD RGSTICSWDS SRDSLRNTFG RQAIPMTWDF AESNVLSEST GGISSGLDQV YRVLVGLPQN TVGKSSQQDA QTQSISRDKV ISTDPPYYDN IGYADLSDFF YVWTRRSLKF VFPDLFSTLA VPKAEELVAT PYRHGNREKA ETFFLNGMTQ AMHRLAEQSH PAFPVTIYYA FKQSETGNDD GTTNTGWDTF LAAVIEAGFS ISGTWPMRTE LSNRMIGSGT NALASSIVLV CRKRPENPPV ATFREFDTAL KSELPQALSQ LQAGNIAPVD LAQAAIGPGM AVYTRYASVL DAEGRPLTVR AALARINKVL DEALAEQEGD FDADSRWALA WFEQMGFNDA EFGTADVLAR AKATSVGGIV EAGIAFSGKG KVRLFRPSEL PSDWDPSTDK RLTVWEMVHQ LIRVLEAEGE PAAAELVAKL GSEAETAREL CYRLYTLCER KKRPQEAMAY NALVQSWPEL TRLAGEMPSL PASGTYNLFE NE
|
| |