Gene Cphamn1_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1149 
Symbol 
ID6374824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1235518 
End bp1238592 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content39% 
IMG OID642683651 
Producttype III restriction protein res subunit 
Protein accessionYP_001959568 
Protein GI189500098 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.511254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0779427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA TATTCAAACA ACAACCCTAC CAAACGGATG CCACTATGGC GGTTGTCGGC 
TGCTTTGAAG GGCAAAGCAA AGGCTTTAGA AAAGAAGTGG TTGGCAGAGA AACCCTCGAC
CACGGGCTGT TTGGCACAGA AGTCAAAGTT GAAGAGATCT TCTCCAATAA AAAACTGGAA
ATTACCGAAG TCGATATATT AAAAAATGTG CAAGCACTTC AAAAAGAACA GGGCTTAAAA
ACCATCAGCA AACTGGATGG TCTCAACTTT ACCACCGAAA TGGAAACCGG AACCGGTAAA
ACCTATGTCT ATACCAAAAC CATGTATGAA CTCAATAAAC ACTACGGCTG GAATAAGTTT
ATTATCATGG TGCCATCAGT GGCGATTCGT GAGGGTGTAC ATAAATCGTT AGAAATTACC
GCAGACCATT TTCAGGAGAT TTACGGTAAA AAAATTCGTT TTTCCATCTA CAACACCCAG
AACAAATCTA ATCTGATAAA CATCAAGAGC TTTGCCAACA CTTCCAATAT CGAAGTGCTT
ATCATGAACT ATCAGGCATT TGCCACCACC AGCCAGGAAT CCAGAAAGAT TTATCAAAAA
CTCGATACAC TGCAAAGTGA ACGGCCTATT GATATTATTA AACGAGCCAG ACCCATTCTG
ATTATTGATG AACCACAACG AATGGGTATT GAAGAAGGTA AAGTTTTCTC TGGGAATAAG
CCATCAAAAG TAGTCGAAGC AATATATCGG AATAATGAAT TTAATCACCT TTGTACTCTA
TTGTATTCAG CGACCCATAA AAAAGATTTC AATAAAATTT ATCGCCTCGA TGCAATTGAT
GCCTACAACC AGAAACTGGT GAAAAAAATA AGTGTCAAAG GGATTGAGGT TGTCGGAAAC
AGCGGAACCA ACAGCTATTT GTTTTTAGAT AAAATCCAAA TCAGCACGAG CCAATTTCCA
GTTGCCTATG TTGAGTTGGA AGTTAAACAG GCTAACGGTA TTCAGAAAAA AATCAGACAG
ATAAAAGAGA AAGATGACCT CTATGTATTC TCCAATGAAT TGAAACAATA CAAAGGCTTT
GTGGTAAAGT CGATTAACGG TCTTACCAAT ACCGTTTCAT TCACCAACGG AATTACCCTC
TCTGTAGGGC AAACCGCAGG AGACGTGGAC GAAGAGCATG TTCGCCGTAT ACAGATTCGG
GAAACGATTA AATCCCATAT TGAAAAAGAA CGAATCATGT TTCAAAAGGG AATTAAAGTA
CTGTCCCTTT TCTTTATCGA TGAAGTGGTA AAGTACCGTG ATTATTCAAG TCCCGACCAG
AAAGGCATTT ATGCCAAAGT TTTTGAGGAT GAATACCGCC AGGCAATCAG CGAGCTAAGT
CTTTTTGAAC CGGACTACAA CGACTATCTT AGTAAATTTA CTGTGGAAGA TATTCACAAA
GGTTACTTCT CCGTTGATAA AAAAGGACTG TTTATAGATT CCAAAGAAAA ACGCGGCGAA
AGCGGTAGTG ATGATGTGAG TGCCTATGAC CTCATTATGA AAAAGAAGGA AATACTTCTT
GATCTAAAGG AATCTACCCG TTTTATATTT TCTCATTCTG CGCTTCGTGA AGGTTGGGAT
AACCCCAATG TATTTCAGAT TTGCACTCTC AAGCACAGCC AATCAGAAAT CAGCAAGCGA
CAGGAGATAG GACGTGGGCT TCGTATTTGT GTCAATTCAA AAGGCGAGCG CATGGATGCA
TCGGTACTTG ATAGTGATTT TTTTGATGTG AATAAACTCA CGGTAGTTGC CAGTGAATCC
TACGATTCCT TTGCCAAAGC CTTGCAGAAT GAAATTGTTG AATCACTTTC AGAGCGTCCA
GTTACCCTTA CGATTGAGGT ATTAAACAAC CGTGTCATTC ATAACGAAAA AGGCGAAAAG
TTTGTTTTTG ATAGCCAGTC CTCTATGGAT TTGATTTTCG ATATGAAAAC CAAGGGCTAT
CTGGACGCAA ATTACCATAT TACCGAAGCC CTGATTACAG ATGTGGAAAA TAAAACTTAT
GCTCTCCCGG AAAAGTTACA AGGTTTTGAA TCTTGCGTTG CTGAACTTAT GACCGGTATC
TACACAACGG CAAACTTTAA AGCAGCAGAA AATGAAAATG CCAATAATAT CAATGAGGCT
ATTCTAAAAC CTAACGATAA CTTTGCCAAA AAAGAGTTTC AGGATTTATG GAAGAAGATT
AAAGTAAAAA CGGTCTATGA AGTGGATTTT GAAAGTGCAG AGCTTGTTGA TCTCTGCGTT
AAGGCAATTG ACACCAATCT CACCGTTAAA AAAATATTGA TAAATATTAC TTCCGGTGAA
CAGCAAGACA AGATTGACGA GGCGACACTC AAATCCGGCG AAAGCATGAA GAAAGAAAAA
AGTGTTACCG AAAGAGCCGA GTCTTTGCTG GGTTCACTGA AGTACGATCT GGTAGCTGAA
ATTGCCAAAG AGACAAATCT TACCCGCAAA ACGGTTGTTA AAATATTGCA GGCATTACGG
CAGGATACCT TCCATTACTT CAGGGTAAAT CCAGAAAGCT TTATTCAGGG TGTAACCAAT
ATCATCAATA GCGAAAAAGC TGCAACCCTT ATTAACAACA TTGTCTACTC AAAAACAGAC
AAGACATACG ATGACAGTAT TTTTACCATC AATAACTTTA AGGGATCACT CACAAAGAAT
ATTCTGGAAG TCAAAAAGCA TGTTCATAAC TATGTAAAGA CCGATTCTGA TATTGAAAGA
AGATTTGCTA CTGATCTTGA ATGTGAAGAA GTGCTTGTCT ATGCAAAACT ACCAGGTGGC
CCGAATGGTT TTAAAATACC AACACCACTG GGAAACTATA ATCCTGACTG GGCTATCGTT
TTTAATACTG ACAAATTCAA GTATGTCTAT TTTATTGCAG AAACCAAGGG AAGCATGGAA
ACCTTGCAAC TGAAAGAAAT AGAGCAGAAA AAAATCAGTT ACGCCAAAAA GCATTTTGAG
GCATTAGGAC ATGCAGATAT TAAATACGAT GTGATTGATT CGTATCAGGC ATTGAGAGAC
AAAATAATGA ATTAA
 
Protein sequence
MKLIFKQQPY QTDATMAVVG CFEGQSKGFR KEVVGRETLD HGLFGTEVKV EEIFSNKKLE 
ITEVDILKNV QALQKEQGLK TISKLDGLNF TTEMETGTGK TYVYTKTMYE LNKHYGWNKF
IIMVPSVAIR EGVHKSLEIT ADHFQEIYGK KIRFSIYNTQ NKSNLINIKS FANTSNIEVL
IMNYQAFATT SQESRKIYQK LDTLQSERPI DIIKRARPIL IIDEPQRMGI EEGKVFSGNK
PSKVVEAIYR NNEFNHLCTL LYSATHKKDF NKIYRLDAID AYNQKLVKKI SVKGIEVVGN
SGTNSYLFLD KIQISTSQFP VAYVELEVKQ ANGIQKKIRQ IKEKDDLYVF SNELKQYKGF
VVKSINGLTN TVSFTNGITL SVGQTAGDVD EEHVRRIQIR ETIKSHIEKE RIMFQKGIKV
LSLFFIDEVV KYRDYSSPDQ KGIYAKVFED EYRQAISELS LFEPDYNDYL SKFTVEDIHK
GYFSVDKKGL FIDSKEKRGE SGSDDVSAYD LIMKKKEILL DLKESTRFIF SHSALREGWD
NPNVFQICTL KHSQSEISKR QEIGRGLRIC VNSKGERMDA SVLDSDFFDV NKLTVVASES
YDSFAKALQN EIVESLSERP VTLTIEVLNN RVIHNEKGEK FVFDSQSSMD LIFDMKTKGY
LDANYHITEA LITDVENKTY ALPEKLQGFE SCVAELMTGI YTTANFKAAE NENANNINEA
ILKPNDNFAK KEFQDLWKKI KVKTVYEVDF ESAELVDLCV KAIDTNLTVK KILINITSGE
QQDKIDEATL KSGESMKKEK SVTERAESLL GSLKYDLVAE IAKETNLTRK TVVKILQALR
QDTFHYFRVN PESFIQGVTN IINSEKAATL INNIVYSKTD KTYDDSIFTI NNFKGSLTKN
ILEVKKHVHN YVKTDSDIER RFATDLECEE VLVYAKLPGG PNGFKIPTPL GNYNPDWAIV
FNTDKFKYVY FIAETKGSME TLQLKEIEQK KISYAKKHFE ALGHADIKYD VIDSYQALRD
KIMN