Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2421 |
Symbol | |
ID | 6376116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2579584 |
End bp | 2582703 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642684899 |
Product | type III restriction protein res subunit |
Protein accession | YP_001960797 |
Protein GI | 189501327 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACCG ACAATCCCAT ACTCAACAGT CCCTACGAAG AACCCCAGCG GCATTATGCA ACAGATGCCG ACGGTTCCTT GAATTACAGC GATATCAGGG ATGGGCGTCG GATATTCGTA CCGGAATTGC AGCCTATCCC AGTTAAGCAG CCCCAAGGAT CATTGCTTGA GATCAACGAT TTCGCGGCAG AATTTGACAG TCATCTGGTC AATCTGCTGC GCTGTGAGAT TGCGCTTTGG AGAGTTGCGG AATATCCCGG AACAACAAGA GTTACAAAAG AACTCCTGAT ATTCTGGTTC AACAACCCTG GTCGTCATGC TGTCCGCAAG CTATTTTTCG CTCAGCGTGA AGCAGTAGAG ACTGCTGTCT GGCTGAACGA GGTGGCGGAA AAAAGCAACT CTGGTCAGAA CATCATGAGC ATTCTGCAGA CCGCTCATCG TTCTGTCAGT GCTGAAAAGG AACAGCAGTT GCCACGCCTC GCATTCAAGA TGGCGACAGG AACCGGAAAG ACCGTTGTCA TGGGGATGCT GATGCTCTAC CATCTCTTCA ACAGGCGCGA ATACCGACAG GATACACGGT TTGCTGATTA TTTCCTGATT ATCACCCCAG GCATTACCAT TCGTGAGCGT CTTGGCGTGT TGTTCGTTGA TAAGCACAGT GCAAGTCGTC AGGAGCGCAC CGATTATTAC GCACTTCGGG ATCTTGTACC CCGGCAATTC GAGCTCGCTC TCGATGGGCT CAATGCACGC CTGGTGATCA CGAACTACCA TGCCCTGGAG CCTAAAACCC TTCAGGGCAA CAAGAAAAGC CCGTTCGACG GCAAGCTCGA TGCCGCGGGA AAGAAGCAGG AAGCCAAAGA GGATTTTGGC AGGCTGATCA ATCGCTTGCT CGGAAGCTTC AGGAAAGGGA GCCGCCTGCT CGTGCTCAAC GACGAGGCCC ACCACTGTTA CCTGCCAAAA TCGTCGGGCA GGACAAAGGA TAACGAAGAA TCGGACGAGA ACGCAAAAGC AGCTATCTGG TTTTCCGGAC TCGTGGAAAT AGCCAAACGC TTCAAGCTGC AACAGGTGTA TGACTTGTCG GCCACCCCCT ACTACCTGCA GGGATCCGGT TACAAGCCCT ACACGCTGTT TCCCTGGGTG GTAAGCGACT TCGGGTTGAT CGAAGCTATC GAAGCGGGGC TGGTGAAAAT CCCGTTCATG CCGCAAAGCG ACAATACACA GGAGCTGGAT ATGCCGGTAT TGCGCAACCT GTATGAGCAT GTGAGCGATG AACTCCCCAA AAAAGGGCGC AAGAAAAAGA AATCCGAGGC GAAAAAAGAG GGCGTGTCGA TCACCGAAGA GCCGCCTGTT CTTCCGAAAC TGGTCAAGGG AGCGCTCGAC CAGTTCTACA ATCACTACCG GGAGTATTGC GACGGCCTGC GCCAGCAGTT CGAGGAAACG GCCGGATTGT TCACCTCTCC GCCGGTCTTT ATCGTTGTCT GCAACAATAC GTCAGTCTCC AAGGAGGTTT ACAAGTTCAT CGCAGGGTAT GAATATGAAC GGACGGGCAA GAATGGTAAC ACGGTCCGCG AAATTGTCGA TGGACACTAT TCGCAGTTCT CCAACTACGA TGCATCGACC AGGCAACCCA GGCACCGTCC GCCGACGCTC CTTATCGACA GCGACGCGCT TGAAAACTCC GACCAGATCA ACGACGAGTT CAAAAAAATC TTCGCATCCG AAATTGCAGA GTTCAAACGT GATTACGCGC GCCTGAAAGG CCAGGGTGCT GCGGAACAGA TCACCGATGC CGAGATTCTC CGGGAAGTGG TCAACACCGT CGGGCAGCCG GGCAAGCTGG GCGCCCACAT CCGCTGTGTT GTCTCGGTCT CGATGCTGAC CGAAGGGTGG GACGCCAACA CGGTTACCCA TATCATGGGA CTTCGCAAGT TCGGCTCCCA GCTTCTCTGC GAACAGGTTG CCGGCCGGGC TCTGCGGAGG ATGAATTACT ACCTGCAGAC CTACAGGAAA GACACCGGCG ACATTGTTCC CGAAACGGAA CGTCACCGCT TCAAGCAGGA AAACCTGGTC GAGAAGTTTC CGCCTGAATA CGCGCACATC ATCGGGGTGC CGTTCAGCAT GTTCAAATCC GGATCGACCA CCCTCACTCC TCCTCCGGAC TACACCCATG TTACGGCTCT GCCTGAACGC CATCAGGAGC TTGAAATCAC GTTCCCCAAC GTCGTCGGCT ACCGGACGGA ATATCTCGAC AAAGGGATCG TTCACGATTT CAGCGGTATC GAGAACTATG AACTGGATTT TTCGAAGTTT CCTACCGAGA TCGTGATGGC ATGTCCGTTC TCCCCGCATC AGGAAACCAT GCAGGTGACA TCCGTTCTGG AGAGACGGGA CCAGGAACTG CTCTACCTGA TCACGAAGGA GCTGATCCGC TACCATTTTG CCGACGACGA CCAGAATCCT CGTTTTCAGC TCTTCGGCGA TCTGAAAAAT ATCGTCGAGG AGTGGTACGA CACAAAGATT GTGCTTCTGA ACCAGTCGGA TGAACGATAC CGGCGACTGC TCTACTTCGA GAACGGCAAA ACCATCGCCG ACCATATTGC ACGGGGTATC AACCCGCACA TCAACACGGA AGAATATATC CGGCCGGTCT TCAACTACTA CAATCGCTTT GGTAGTACGA AATACGTCAG TGGCAATACC ACGAAAGAAA CCTGGCCGAC GTCGAAAAGC CATGTCAATG CAGTTGTCAT GGACAGTGAC TGGGAGGCCA TTGCAGCCAA GACACTGGAG GAGATTCCCG AGGTCGTTTC CTATGTCAAG AACCAGTTTC TCGGTTTCAC GATCCCGTAT GTGAAGGATG GCAAGGACAA GCTCTACTAT CCTGATTTCC TTGTCCGTCA CGTAACTCCA ACCAGAGAAA CCGCCAACCT GATCATCGAG ATCAGCGGCA TGAGCAAGGA CAAGGCCGAA AAGAAATGGT TCGTGCACAA CCGCTGGCTG CCGGCCGTGA ATGCCGTGCA GGAAAAATAC GGACTCGGCC GCTGGCACTT CATCGAGATC GCCAACGATA TCCGCGACAT CAGGACCCAG TTGGCGGAAA ATATTAAAAT CAACCTATAA
|
Protein sequence | MTTDNPILNS PYEEPQRHYA TDADGSLNYS DIRDGRRIFV PELQPIPVKQ PQGSLLEIND FAAEFDSHLV NLLRCEIALW RVAEYPGTTR VTKELLIFWF NNPGRHAVRK LFFAQREAVE TAVWLNEVAE KSNSGQNIMS ILQTAHRSVS AEKEQQLPRL AFKMATGTGK TVVMGMLMLY HLFNRREYRQ DTRFADYFLI ITPGITIRER LGVLFVDKHS ASRQERTDYY ALRDLVPRQF ELALDGLNAR LVITNYHALE PKTLQGNKKS PFDGKLDAAG KKQEAKEDFG RLINRLLGSF RKGSRLLVLN DEAHHCYLPK SSGRTKDNEE SDENAKAAIW FSGLVEIAKR FKLQQVYDLS ATPYYLQGSG YKPYTLFPWV VSDFGLIEAI EAGLVKIPFM PQSDNTQELD MPVLRNLYEH VSDELPKKGR KKKKSEAKKE GVSITEEPPV LPKLVKGALD QFYNHYREYC DGLRQQFEET AGLFTSPPVF IVVCNNTSVS KEVYKFIAGY EYERTGKNGN TVREIVDGHY SQFSNYDAST RQPRHRPPTL LIDSDALENS DQINDEFKKI FASEIAEFKR DYARLKGQGA AEQITDAEIL REVVNTVGQP GKLGAHIRCV VSVSMLTEGW DANTVTHIMG LRKFGSQLLC EQVAGRALRR MNYYLQTYRK DTGDIVPETE RHRFKQENLV EKFPPEYAHI IGVPFSMFKS GSTTLTPPPD YTHVTALPER HQELEITFPN VVGYRTEYLD KGIVHDFSGI ENYELDFSKF PTEIVMACPF SPHQETMQVT SVLERRDQEL LYLITKELIR YHFADDDQNP RFQLFGDLKN IVEEWYDTKI VLLNQSDERY RRLLYFENGK TIADHIARGI NPHINTEEYI RPVFNYYNRF GSTKYVSGNT TKETWPTSKS HVNAVVMDSD WEAIAAKTLE EIPEVVSYVK NQFLGFTIPY VKDGKDKLYY PDFLVRHVTP TRETANLIIE ISGMSKDKAE KKWFVHNRWL PAVNAVQEKY GLGRWHFIEI ANDIRDIRTQ LAENIKINL
|
| |