Gene Cphamn1_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2548 
Symbol 
ID6376246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2720415 
End bp2721632 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content44% 
IMG OID642685026 
Productrestriction modification system DNA specificity domain 
Protein accessionYP_001960923 
Protein GI189501453 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA GGTTCAAGGC TGTTTTGCTG GGCGATATCG TTATTCCTAA AGGAATACAA 
ACAGGGCCTT TTGGGAGTCA ATTGAAAGCT GAAGAATATA CGGAAGATGG CGTTCCAGTA
GTAATGCCAA AAGATATTTG CAGCGGGTAT TTGACAAGTT CATTTATTTC AAGAGTCTCA
CAATCCAAGG CAAATAAACT GAAGAAGCAT CAGATTAAGG AAGGGGATAT AATATTTCCC
AGACGGGGAG ACCTGCGTCG AATCGGTGTG GCAAGAAAAG ACAATACTGG CTGGATTTGT
GGGACGGGTT GCCTTAGGGC GCGATTGAAT AGTGTAGTTC ATTCTGACTT TTTGCACCAA
TATGTGCTCT TAGACTCAGT TGGCAAATGG CTGGAAAGAA ATGCTCTTGG CCAGACCATG
TTGAATCTTA GTACAGACAT TATTTCTAAC CTGCCTTTAA CTCTGCCCTT GCTTTCTGAG
CAAAAAGCTA TTGCCGATTT GCTCTCTGCC TGGGATGAGG CTATCGAGAA AGCCGAACGA
CTGATTCAGG AGAAGGAGAG GCGGTTTAGG TGGTTGCTTC GTGAGTTGAT CTCAGAACCA
CGGAATACAC GGAAAGATGC GGAATGGAAA AAGGTGAGAA TGGGATCATT TTTGACTGAG
AGTCGGATTC CCGATCGTGA AAATGATCCC AAGAAGCGAA TAAGTGTGAG GCTGCATTTG
AGGGGCGTTG AGGTTCGAGA ATATCGAGGA ACTGAGTCTA ATGGAGCAAC TGCATATTTT
ATCCGTAAAG CAGGTCAGTT TATCTACGGG AAACAAAATG TCTTTAGAGG GGCTGTTGGC
ATAGTACCCC TTGAATTAGA TGGCTATAGC TCAACTCAGG ACATACCGGC ATTCGACATA
GCTGATCATG TTGATAAGAG CTGGCTTCTA TTTCTCTTTT CATATACAAA CTTTTACAAA
AAATTAGAGC TATATGCGAG CGGGTCGGGA TCAAAGCGAC TTCATCCCAA AGAGCTCTTC
AGAATGAAAA TCACCTTACC AACATTCGGC GAACAGCAAC AAATTGCTGA GACATTATCC
TCAGCCCAGT ACGAAATCGA CCTGCTGAAG CAGCTCGCAG AAAAATACAA AACCCAGAAA
CGCGGCCTGA TGCAGAAGAT GCTCGCTGGC ACATGGCGGG TAAAACCGGA AATCGTTCAT
CAATACATGG AGGCGTAA
 
Protein sequence
MSDRFKAVLL GDIVIPKGIQ TGPFGSQLKA EEYTEDGVPV VMPKDICSGY LTSSFISRVS 
QSKANKLKKH QIKEGDIIFP RRGDLRRIGV ARKDNTGWIC GTGCLRARLN SVVHSDFLHQ
YVLLDSVGKW LERNALGQTM LNLSTDIISN LPLTLPLLSE QKAIADLLSA WDEAIEKAER
LIQEKERRFR WLLRELISEP RNTRKDAEWK KVRMGSFLTE SRIPDRENDP KKRISVRLHL
RGVEVREYRG TESNGATAYF IRKAGQFIYG KQNVFRGAVG IVPLELDGYS STQDIPAFDI
ADHVDKSWLL FLFSYTNFYK KLELYASGSG SKRLHPKELF RMKITLPTFG EQQQIAETLS
SAQYEIDLLK QLAEKYKTQK RGLMQKMLAG TWRVKPEIVH QYMEA