Gene Cpha266_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1710 
Symbol 
ID4571070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1937703 
End bp1938947 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content49% 
IMG OID639766293 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_912152 
Protein GI119357508 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTCG AAAATTTGCA ATGCGCTGAG GCGCGACTTG GTCAGAACAA TTGCTTGCAA 
CTTGAGGAGG ATGCGGGATG TGGGATGATG TATGAACTTC GCACAATCCA TATTGGTGAC
TTGGGTCGTG TCTTGACTGG AAAGACGCCC CCAAGTGTCC GGCCTGAACT ATTTGGAGAT
GATCACCCGT TTCTTACACC AACAGACATT GACGGTGCTT CGCGTTACAT TGAGCCCGAA
CGTTTTCTTT CGCCAGAAGG GCGCAACTAC CAGCAAAGAC TTATGCTACC TGGGCGATCC
GTTTGCGTTG TCTGTATTGG TGCTACCATT GGCAAAGTTT GCATGACTGG CAGACCGTCT
TTCACAAATC AACAAATTAA TTCCGTTGTC GTGAATGAGC AAGAGCACGA TCCGTTCTTT
GTTTATCACC TGATGACAAC GCTTCGCGAC GAGTTAAAAG CTAATGCTGG TGGGTCGGCG
ACTCCCATTA TCAATAAGAC GGCGTTTTCA GAAATCAAAG TACGTGTTCC CCCGCTTCCA
GTTCAACGGC GGATTGCGGG CATACTGTCA ACCTACGACG AACTGATTGA GAACAGTCAG
CGGCGCATCA AGATTCTGGA GGAGATGGCC CGATCAGTCT ATCGTGAATG GTTCGTTCAC
TTCCGCTTTC CCGGCCACGA AAATGTTTCG CTCGTTTCGT CTTCTCTTGG TGCTATTCCG
CAGGGGTGGG AGGCTGGTCG TTTAGACGAT GTGCTTGTTC TTCAACGTGG CTTCGATTTG
CCTAAAGCCA AGCGGATGGA GGGTACTGTG CCCATTTACG CAGCTACCGG AGTTACTGGA
TTTCACTGCG AAGCTAAGGT CAAAGCACCT TGTGTTGTGA CCGGAAGATC AGGCACAATT
GGAGATGTCA TCTATGTACA GGAAGATTTT TGGCCACTGA ATACCTCACT TTGGGCGAAG
GGTTTTCCAA AGTCGGAACC GCTTTATGCA TACTACGTGC TCTCTTCAGT TGGCTTGAAG
CAGTTCAATT CCGGGGCGGC TGTTCCGACG CTTAATCGAA ATGACCTTCA TGGTCTTGAC
GTGCTGATTC CTCCATGCGT ATTGCAAAAA CGATTTCAAA AAATTGCCGG TGCAATGTTA
TTACAAACCC GCAATCTTGA ACTGCAAATT CAAAACCTTC GTCGGACGCG CGATCTACTG
TTGCCGCGTC TGCTATCGGG GCAGGTCAAT CCCAAGGAGA ATTGA
 
Protein sequence
MKVENLQCAE ARLGQNNCLQ LEEDAGCGMM YELRTIHIGD LGRVLTGKTP PSVRPELFGD 
DHPFLTPTDI DGASRYIEPE RFLSPEGRNY QQRLMLPGRS VCVVCIGATI GKVCMTGRPS
FTNQQINSVV VNEQEHDPFF VYHLMTTLRD ELKANAGGSA TPIINKTAFS EIKVRVPPLP
VQRRIAGILS TYDELIENSQ RRIKILEEMA RSVYREWFVH FRFPGHENVS LVSSSLGAIP
QGWEAGRLDD VLVLQRGFDL PKAKRMEGTV PIYAATGVTG FHCEAKVKAP CVVTGRSGTI
GDVIYVQEDF WPLNTSLWAK GFPKSEPLYA YYVLSSVGLK QFNSGAAVPT LNRNDLHGLD
VLIPPCVLQK RFQKIAGAML LQTRNLELQI QNLRRTRDLL LPRLLSGQVN PKEN