Gene Cphamn1_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0334 
Symbol 
ID6373994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp338924 
End bp340216 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content49% 
IMG OID642682853 
Productrestriction modification system DNA specificity domain 
Protein accessionYP_001958784 
Protein GI189499314 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.284396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA AGGGGGGCGG ACTTTCCAGT CCGCGATTAA AAAACTCAAG CGGGTTGGAA 
AACCCGCCCT ACGTTGAAGG TGATTCTGGA GGAAGTCTTG ATATGAGGGA TTCTAATCTA
CTCATTGAAT CACTACCAGA TAGATGGAAA AACCACAAGT TTGGTGATTT ATGCGATCGA
GTGAAAAACT CATATCAACC AGTCGATGGG GGTGAAAAAC CATATATCGG CCTTGAGCAT
CTTGCCCAAG GATTCCCGGC GTTCATTGGT CGGGGAAAAG AATGTGAGGT AAAGTCCTCG
AAAACAGTAT TTAAATCTGG TGACATTCTC TTTGGGAAAC TTCGTCCATA TCTGCGTAAG
GGAGCTCAGG CGGACTTTGA TGGCATTTGT TCAACAGATA TTCTCGTGTT TCGAGCGAAA
CCCATTTGTG AATCGAATTT TCTGAGATTC GTTATCCACA GTGAGGAATT TGTGGCTCAT
GCAAAAACAA CGACAAGCGG AGTACGGCAT CCCAGGACAT CATGGCCATT GTTGCGTGAG
TTTTACATAT CGCTCCCCCC ACTGCCCGAG CAGAAGAAAA TCGCACACAT CCTTTCGACA
GTGCAGCGGG CAATCGAAGC ACAGGATCGG ATCATCCAGA CAACCACCGA GCTGAAAAAA
GCCCTCATGC ACAAGCTTTT CACAGAAGGG CTGCGCAACG AACCGCAGAA AGAAGCCGAG
ATTGGGCTGG TTCCGGAGAG TTGGGAGGTG GTGGAGATTG GCGACGTTTT CAAGTTCACC
AGTGGTAAGA CAAAACCAAA GGACACTGCA CCTGAGCCAT CCGTTGAGCG GACAGTTCCC
GTGTATGGAG GAAATGGAGT GCTGGGCTAT TCAGCGCAGA GCCTTCTCAA TGAGGATGTG
TTAATCCTTG GCAGGGTTGG CGAGTATTGT GGATGTGCTC ACCTCACAAA GCCTGTCTCG
TGGGTAACTG ATAACGCCCT CTACGCGAAG GAGGAGAAAC GGTCCGTGAA TCGTAGTTAC
GCGCGGACTC ATTTCGCGCA CCTCAACCTC AACCAATACA GCAACAAGAT GGGGCAGCCG
CTGATTACCC AAGGAATCAT CAATCGGGTG AAATTCGGAC TCCCGTCTCG CGAAGAACAA
GACGAACTCG CGAACGCTTT TGAAACGCTC GACACCCGTA TCGAGCAGAT CAATGCAAAA
AAGAAGTCTC TCCAAGACCT CTTCCACACA CTACTTCACG AACTGATGAC CGCGAAGATT
AATGTGGGCC ATATATCCGA AAAAATTGCA TGA
 
Protein sequence
MNTKGGGLSS PRLKNSSGLE NPPYVEGDSG GSLDMRDSNL LIESLPDRWK NHKFGDLCDR 
VKNSYQPVDG GEKPYIGLEH LAQGFPAFIG RGKECEVKSS KTVFKSGDIL FGKLRPYLRK
GAQADFDGIC STDILVFRAK PICESNFLRF VIHSEEFVAH AKTTTSGVRH PRTSWPLLRE
FYISLPPLPE QKKIAHILST VQRAIEAQDR IIQTTTELKK ALMHKLFTEG LRNEPQKEAE
IGLVPESWEV VEIGDVFKFT SGKTKPKDTA PEPSVERTVP VYGGNGVLGY SAQSLLNEDV
LILGRVGEYC GCAHLTKPVS WVTDNALYAK EEKRSVNRSY ARTHFAHLNL NQYSNKMGQP
LITQGIINRV KFGLPSREEQ DELANAFETL DTRIEQINAK KKSLQDLFHT LLHELMTAKI
NVGHISEKIA