Gene Cphamn1_0750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0750 
Symbol 
ID6374415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp797634 
End bp798944 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content54% 
IMG OID642683258 
Productputative type I restriction-modification system 
Protein accessionYP_001959184 
Protein GI189499714 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.769779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.272821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC CGCGCTATCC GAAATACAAG GCCAGCGGCG TGGAGTGGCT GGGGGAGGTG 
CCGGAGCATT GGCAGATGAT CAATAGCCGC CGGTTATTCC ACCAAGCAAA GGAATCACCC
TTAACCGACG ACATCCAACT CTCAGCGACC CAAAAATACG GGGTTGTCCC TCAGAGCCTT
TTTATGGAAA GCGACGGTAA AGTCGCTCTA GCGTTAAGCG GACTAGGGAA TTTTAAACAC
GTCGAAGTAG ATGATTTTGT GATCAGTCTC CGCAGTTTTC AAGGAGGTAT CGAGCGGAGC
AAATATTCAG GTTGCGTCAG CCCTGCTTAC ACCGTGCTGC GCCCGGCGGA ACCTATTGAT
GGAAGCTATT GGGGCTTCCT ACTGAAATCA CGCCGATACG TCGAGATCCT CCAAACGATG
AATGATGGGT TACGGGATGG CAAAAGTATC AGTTACCAGC AGTTCGGACA AATCCCCTTG
CCTTCCCCGC CCCTCGCCGA GCAAACGGCC ATTGCGGAGT TTCTGGACCG GGAGACGGGG
AAGATTGATG AGCTGGTGGC GGAGCAGCGG CGGCTGATGG AACTGCTGAA AGAAAAACGT
CAGGCTGTCA TCTCCCACGC CGTCACCAAA GGCCTCAACC CCCACGCCCC CATGAAGCCT
TCCGGCATCG AATGGCTCGG CGATGTGCCG GTGGGGTGGA GCGTGCTCAA ACTCGGAAAC
ATTTCTCGTT TTAAAGGGGG GGCAGGGTTT CCCGATAGCT ACCAAGGTCA AACAGACAAC
GAGATTCCGT TCTTTAAAGT CGGAGATATG GTGAACGCTG ACGACGCTCG CGTAATGCGG
AGAGCTAATC ACACAATCAC TGAAGCTACA GCGAGAGAGC TGCGTGCTTT TGTCTTTCCT
GAAAGCACCA TCGTGTTTGC CAAGGTTGGC GCTGCCTTAC TCCTGAAGCG ATACCGGTTA
CTTGGGCAAA GGTCTTGCAT CGACAACAAC ATGATGGGAA TGACCGTGGG GGACGGTAGT
TCGGTCGATT ATCTGCTCTA TGTTCTCCCG CTACTTGATC TCGAATTAAT TGTTAACCCC
GGGGCCGTGC CATCAATCAA TGAAGGTCAG ATTTCTGGCC AACGGATCGC ACTTCCTCCG
ATTGATGAGC AGCGAGAAAT TGTTGAATTT CTCACCTCAG TAACCGCCAA ATTCGACACC
CTCACCGCCG AAGCGCAACG CACCATCGAC CTGCTGCAAG AACGCCGCAC CGCGCTCATC
TCCGCCGCCG TCACCGGGCA GATTGATGTG CGCCAACCAC CCCGGAACTA A
 
Protein sequence
MSFPRYPKYK ASGVEWLGEV PEHWQMINSR RLFHQAKESP LTDDIQLSAT QKYGVVPQSL 
FMESDGKVAL ALSGLGNFKH VEVDDFVISL RSFQGGIERS KYSGCVSPAY TVLRPAEPID
GSYWGFLLKS RRYVEILQTM NDGLRDGKSI SYQQFGQIPL PSPPLAEQTA IAEFLDRETG
KIDELVAEQR RLMELLKEKR QAVISHAVTK GLNPHAPMKP SGIEWLGDVP VGWSVLKLGN
ISRFKGGAGF PDSYQGQTDN EIPFFKVGDM VNADDARVMR RANHTITEAT ARELRAFVFP
ESTIVFAKVG AALLLKRYRL LGQRSCIDNN MMGMTVGDGS SVDYLLYVLP LLDLELIVNP
GAVPSINEGQ ISGQRIALPP IDEQREIVEF LTSVTAKFDT LTAEAQRTID LLQERRTALI
SAAVTGQIDV RQPPRN