Gene Cphamn1_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0749 
Symbol 
ID6374414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp794653 
End bp797631 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content59% 
IMG OID642683257 
Producttype III restriction protein res subunit 
Protein accessionYP_001959183 
Protein GI189499713 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.186322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA TCACCTCCGA ATACGCCTTT GAGGAACACA TGGAAGAAGT CCTGCTGAAG 
TCGCATGGCT ATTTCCCCGC CCAGCAGGTG GACTACGACA AGGCCCTGTG CCTGCGGCCC
GAGACGGTCA TCTCCTTCAT CCGCGCCACG CAGACCAAGA AGTGGGCCGA CTACTGCGAG
TTGATCGGCG ACAAGGGGCA GGCGACCCGC AACCTGCTCA AGCGCATCAA GGAGGTGGTG
GACAAGGAAG GCACGCTGCA CGCCCTGCGC AAAGGCTTCG ACATCCACGG CTCTGGGCAT
TTTGACCTGT GCTACTTCGA GCCCACCAAC CCGGTGGCGG AGGAGAGTCG GCGGCTGTAT
CAGGAAAACC TCCTGCACGT CCAGCGGCAG GTGAAGTTCT CGGAAAGCGA CGAGAAGTCG
CTCGACATGG GCATTTTCCT CAACGGCCTG CCCATTTTCA CCATCGAGCT GAAGAACCAG
ATCTCGGGCC AGAACGTCGG CCACGCGATG AAGCAATACA AGAACACCCG CGATCCCAAG
GAGCCGCTGT TCCGCTTCAA GCGCTGCCTC GCCCACTTCG CGGTGGATAA CGATCTGGTC
TATGTGGCCA CCGAACTCGC CGGAGCCAAG ACCCAGTTCC TGCCCTTCAA CCAGGGGAAC
GACGGCGGCG CAGGCAACCC GCCCTGCAAG ATCGGCTATG CCACCAGCTA CCTGTGGCAG
GAGGTCTGGA AGAAGCCGCG CATCCTCGAC CTGATCCAGC GTTTCATCCG CGTGATGGAT
GTGCTGGATG ACAAGGGCAA GAAGACCGAC AAACAGCGGC AGATTTTCCC GCGCTTTCAC
CAGCTCACCT GCGTGCGCGA ACTCGGGGCC GATGCCCAGC AGCACGGAGC GGGAAGGCGT
TACCTGAACC AGCACAGCGC GGGCAGCGGC AAGACCAACT GCATTGCATG GCTGGCCAAC
AGCCTTGCCA CCCTGCACAC CAAGGAGGGC CAGCCGGTCT TCAGTTCGGT GATCGTGATT
TCCGACCGCC GGGTGATCGA CCGTCAGCTT CAGCGCACGC TTACGCAAGT GATCGAGACC
CCCGGCATGT TGGTCAACAT CGCCGCCGAT GACGGCATGA CCTCCAAAGA TCTCAAGCGG
GCGCTGGAGG ACGGCAAGCG CATCATCGTC ACTACGCTGC AAAAGTTTGG CGTGATCATG
GACAGCATGG GCGATCTGCC GGGCGAACGC TTCGCGGTGA TCGTGGATGA GGCGCACTCG
TCGCAGGCGG GGCAAAGCGC CCAGGCGGTG CAGAAGGTGC TCTCCTATTC CTCCGAAGAC
GAGCAGAAGG ACGAAGAGGA GAAGACCACC GAAGACCGGA TTCTGGAAGA GCTGAAGACA
CGCGGCCCGC AAAAGAACGT GAGTTATTTC GCTTTCACCG CCACGCCCAA GCCGGAGACC
TTGCAGCAGT TCGGCACCAA GCATAAGGAC GGCACTTTCA AGCCATTCAG TCTCTACACC
ATGAGGCAGG CGATTGAGGA AAAGTTCATC CTCGATGTGC TCAAGAACTA CACCACCTAC
GACCAATACT GGGCCTTGCT CAAGAAGGTG AAGGACGATC CGGAGTTCGA GGAGGTAAAG
GCCAAGTCGC TTCTCAAGCA GTTCGTCAGC CGCCATGAGC GCACTATCGC CAAGAAGGTC
GCCATCATCG TGGCGCATTT CCAAGCATCG GTCAGCGACA AGCTGGGCGG CAAAGCCAAG
GCGATGATCG TCACCTCCTC CCGCCTGCAC GCCGTGCGCT ACAAGCTGGC TGTGGATGCC
TGTCTAAAAA AGAAGGGCAT TCCGTTCAAG GCGCTGGTGG CTTTCACCGA CGTGGTGAAG
GATTCCAAGG ACGGCAAGGA ATACACCGAA GCCAACATGA ACGGCTTCCC CGAGGCCGCG
ACGGCGGATC GCTTCGAGGC GGATGAATAC CGCTTCCTGA TCGTGGCGAG CAAGTTCCAG
ACTGGCTTCG ACCAACCCAA ACTGCTGGCG ATGTATGTGG ACAAGAAGCT CAGCGGCGTG
GCCTGTGTTC AAACGCTCTC GCGCCTCAAC CGCACCATGG TGGGCAAAGA AGAGACCTTC
GTGCTGGACT TCGAGAACAC CGCCGAGGAC ATCGAGAAAG GCTTCCAGCC ATTCTACGAC
CGCATCACGC TGTCCAAGGA AACCGACCCC AATCAACTCT ACAACATCCG CACCGACCTC
GATAAATTCG GGATTCACAC AGAGGCGGAT TTGAAGGCAT TCGCCGACCA ATGGTTCGCC
CCGAAGCCCA AAATCGAAAA GCTCCACGCG CTCACCGATC CTGTGGCCGA GAAGTGGAAG
AAGGAAGGCG AGGAAGAGCA GGTGGACTAC AAATCCAAGG CGCGGGATTT CGTGAAGCTT
TACGCCTTCA TGTCGCACCT CGTCCCGCTG AAGGATACCG GGCTGGCGAA GCTCGACGTG
TTCCTGCGCT TCCTGCTCCC CAAGCTGCCC GCCAAGAAAG GCGAGACCCC GCTGGAAGTG
CTTGGAATGG TGGATATGGA AAAGCTCGCC GTGCGCAAGA AGGACAAAAA AGACATCGGC
CTGAAACGCG GCGAGGAAAA GGTGGACCCA CTCAACTACG GCGGCGGCGC TACCCTCTCG
GAAGAGGAGC GGGAGCAGCT ATCCAAGATC ATCGAAGACC TCAACACCCG CTTCAACACC
TCGTTCACCG AAGACGAGAT CATGGTGATC AAGCAGCTCG AAAAGAAGAT CGGCGAGGAC
GAGGCCCTCC AACAACAGCT CAAGAACGGT TCGCCCCACG CGGTCGCAGC AACCTTTGAG
CAGGTGGCGA AAGATGCCTT CGAGGATTTG GTGAACGACA ACTTCAAGTT CTACAAAAAG
GTGAGCGAGG ACGACGAAGT CTCGAAGGAG TTTTTTGCTC GCCTGTTTGA GTGGTATGTC
GAGGGGCGCA AGAAGACAAC GCCGAAGAAA GAAAAGTAG
 
Protein sequence
MAKITSEYAF EEHMEEVLLK SHGYFPAQQV DYDKALCLRP ETVISFIRAT QTKKWADYCE 
LIGDKGQATR NLLKRIKEVV DKEGTLHALR KGFDIHGSGH FDLCYFEPTN PVAEESRRLY
QENLLHVQRQ VKFSESDEKS LDMGIFLNGL PIFTIELKNQ ISGQNVGHAM KQYKNTRDPK
EPLFRFKRCL AHFAVDNDLV YVATELAGAK TQFLPFNQGN DGGAGNPPCK IGYATSYLWQ
EVWKKPRILD LIQRFIRVMD VLDDKGKKTD KQRQIFPRFH QLTCVRELGA DAQQHGAGRR
YLNQHSAGSG KTNCIAWLAN SLATLHTKEG QPVFSSVIVI SDRRVIDRQL QRTLTQVIET
PGMLVNIAAD DGMTSKDLKR ALEDGKRIIV TTLQKFGVIM DSMGDLPGER FAVIVDEAHS
SQAGQSAQAV QKVLSYSSED EQKDEEEKTT EDRILEELKT RGPQKNVSYF AFTATPKPET
LQQFGTKHKD GTFKPFSLYT MRQAIEEKFI LDVLKNYTTY DQYWALLKKV KDDPEFEEVK
AKSLLKQFVS RHERTIAKKV AIIVAHFQAS VSDKLGGKAK AMIVTSSRLH AVRYKLAVDA
CLKKKGIPFK ALVAFTDVVK DSKDGKEYTE ANMNGFPEAA TADRFEADEY RFLIVASKFQ
TGFDQPKLLA MYVDKKLSGV ACVQTLSRLN RTMVGKEETF VLDFENTAED IEKGFQPFYD
RITLSKETDP NQLYNIRTDL DKFGIHTEAD LKAFADQWFA PKPKIEKLHA LTDPVAEKWK
KEGEEEQVDY KSKARDFVKL YAFMSHLVPL KDTGLAKLDV FLRFLLPKLP AKKGETPLEV
LGMVDMEKLA VRKKDKKDIG LKRGEEKVDP LNYGGGATLS EEEREQLSKI IEDLNTRFNT
SFTEDEIMVI KQLEKKIGED EALQQQLKNG SPHAVAATFE QVAKDAFEDL VNDNFKFYKK
VSEDDEVSKE FFARLFEWYV EGRKKTTPKK EK