Gene Cphamn1_1674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1674 
Symbol 
ID6375360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1810462 
End bp1811499 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content53% 
IMG OID642684168 
Productrestriction endonuclease 
Protein accessionYP_001960074 
Protein GI189500604 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.890961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000510843 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGTCC AGAGCGCAAC TATCCAGGCA CTAAAAGAGG CGGGCAAGCC GCTGCATGCC 
AATGAAATCG CCAAGCTGAT CATTGAAGCT GGGCTCTGGA AGTCTGATGG CAAGACACCG
GAGGCCACCG TTAGCGCCAG CCTCTATTCG GACATAAAGA AGAATGGCGA CAAGTCACTC
TTTGTAAAAG TTGGCCCTCA AACCTTCGCA CTCCGGGATG TTCAGCTTGT TGCAGTTTGC
GATATGAAGG AACCTGAACC AAAAATCGAG AGTGGCCCGG TCAGCTCCCA AGAGACGTAT
TCATTTCTGG ATGCCGCAGA AAAGGTGCTG GATCAGTTTG GAAACCGGAA CTCCATGCAT
TACCGGGACA TTACTGACAA GGCCCTGAAT CAGGGTTGGT TGAACACCAC TGGCAAGACA
CCCGAAGCAA CAATGAATGC TCAACTGGTG ACAGAGCTCA AACGGGCCAA AGCCAGCGGG
GAGCCCGGCC GATTTGTCCG CACATCACCC GGGTATTACA GCCTTGTTGA ATGGATGGGG
ACTGGCTTGC CGTATCAAAT CTCAAAGCAC AATCGAGAGA TCCGGAAAAA GCTGCTGTCT
CAGCTAATGG ATTTGAGCCC GGCTCAATTT GAGGAGCTCG TCGGACAACT GCTGGCCGAG
ATGGGCTTTG AAAGCATCGA GGTAACCAAG TACGGAGGCG ACGGTGGCGT TGATGTCCGG
GGCACGCTGC TGATCAGTGA TGTAGTCCGC ATTAAAATGG CTGTTCAAGC AAAACGCTGG
AAAGGCAATA TTCAGAGTCC GACCGTTCAA CAGGTGCGTG GCAGCCTTGG TGCGCATGAG
CAGGGCCTAA TCATCACCAC CAGCGACTTC AGCACCGGAG CCATCAAGGA GGCCAATCAG
CCGGACAAAA CCCCCGTGGG CCTCATGAAT GGCGAGCAGC TGGTCACGCT TCTGATGGAA
TACAACATAG GTGTCCGCCG CATGTCACAC GACCTTTTCG AGCTGGAGGA ACTGCCGATT
GAAAAGGGAG ATGTATAG
 
Protein sequence
MSVQSATIQA LKEAGKPLHA NEIAKLIIEA GLWKSDGKTP EATVSASLYS DIKKNGDKSL 
FVKVGPQTFA LRDVQLVAVC DMKEPEPKIE SGPVSSQETY SFLDAAEKVL DQFGNRNSMH
YRDITDKALN QGWLNTTGKT PEATMNAQLV TELKRAKASG EPGRFVRTSP GYYSLVEWMG
TGLPYQISKH NREIRKKLLS QLMDLSPAQF EELVGQLLAE MGFESIEVTK YGGDGGVDVR
GTLLISDVVR IKMAVQAKRW KGNIQSPTVQ QVRGSLGAHE QGLIITTSDF STGAIKEANQ
PDKTPVGLMN GEQLVTLLME YNIGVRRMSH DLFELEELPI EKGDV