Gene Cpha266_1131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1131 
Symbol 
ID4570338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1282314 
End bp1283555 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content43% 
IMG OID639765727 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_911595 
Protein GI119356951 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAG TTAAACTCGG CACTTTCATC ACGATATCCA AAGGGAAGAA GCATACTCTT 
TCTGAAATGC CATCATCTCA ATCAATACGA ATGCTTGGTA TTGATGATTT GCGCAACGAC
ACATTGATCA GAATGACCGA CGATAAAGAT GGTGTGCTTG CTTGTGTCGA TGATGTACTC
ATCGCTTGGG ATGGAGCCAA TGCTGGTACT ATCGGATATG GAAAACAGGG ATATATTGGC
AGCACAATTT CTCGCCTTCG CCTGCATGAC ACATCTAAAT TTTTTGCCCC GTTCATCGGG
ATGTTTTTAC AGTCGAATTT TAGTTATTTG AGGAAAACGG CTACCGGCGC AACGATTCCG
CATATCAATC GAAACGCATT AGAAAGTATT CAGGTACCTG TTTTTACGTA TGGCGACCAA
ATCTGTATCG CAACCCTTCT TTCCAAAGTT GAGAACCTGA TCTCCCGCCG TCGTGAGCAA
CTCAAACAGC TTGACGAACT GCTCAAAAGT GTTTTTCTGG AGATGTTTGG CGATCCAATG
ATTAATCCCA AGAAGTTTCC GATAAAATTG CTTTCCGAGT TCTACATCAA TTCAAAACAT
GGCACAAAAT GTGGCCCATT TGGTAGTGCC TTGAAAAAAA ATGAATTGCT TGAATCGGGG
ATAGCCGTTT GGAATATGGA TAACATAAGC TCTTCTGGCA TAATGATATT GCCATTTCGT
ATGTGGGTGT CTGAAGAAAA GTTTCAAGAG CTACGAGCAT ATTCTGTCAT TAATGGTGAT
ATCATTATCT CTCGCGCAGG CACTGTTGGT AAGATGTGTG TTGCAAAAAC GGATGGCATA
CCTGCAATTA TCAGCACAAA CCTTATCCGA CTACGACTCA ATTCGTTACT TCTTCCGCTT
TATATTGTTT CGCTGATGAC ATACTGCAAT GGCCGTGTTG GTCGGCTTAA AACAGGGGCA
GATGGCACTT TTACACATAT GAATACGGGG ATTCTCGATA TACTTGAATT TCCATATCCA
TCGATCGAGC TTCAACGCCA ATTCGCTGAT ATCGTCGAAA AAGTGGAAAG CATTAAGGTT
TATTACCATC AAAGCCTTGC TGAACTTCAA AACCTTTATG GTACTCTCAG CCAAAAAGCG
TTCAAAGGTG AACTGGATTT TTCGCGTGTG CCGTGTTTTG CCGCTACTCA AAGAGAGGTG
AGTCGGCGAC CTGAAGGGCT TCAGTACACT GTAATGGGGT GA
 
Protein sequence
MKTVKLGTFI TISKGKKHTL SEMPSSQSIR MLGIDDLRND TLIRMTDDKD GVLACVDDVL 
IAWDGANAGT IGYGKQGYIG STISRLRLHD TSKFFAPFIG MFLQSNFSYL RKTATGATIP
HINRNALESI QVPVFTYGDQ ICIATLLSKV ENLISRRREQ LKQLDELLKS VFLEMFGDPM
INPKKFPIKL LSEFYINSKH GTKCGPFGSA LKKNELLESG IAVWNMDNIS SSGIMILPFR
MWVSEEKFQE LRAYSVINGD IIISRAGTVG KMCVAKTDGI PAIISTNLIR LRLNSLLLPL
YIVSLMTYCN GRVGRLKTGA DGTFTHMNTG ILDILEFPYP SIELQRQFAD IVEKVESIKV
YYHQSLAELQ NLYGTLSQKA FKGELDFSRV PCFAATQREV SRRPEGLQYT VMG