Gene Cpha266_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1603 
Symbol 
ID4571126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1821837 
End bp1825349 
Gene Length3513 bp 
Protein Length1170 aa 
Translation table11 
GC content45% 
IMG OID639766184 
ProductEco57I restriction endonuclease 
Protein accessionYP_912048 
Protein GI119357404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.741912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC AACAGGGCAA AGAACTGATC GACAAAGTCT TTACGCACTC TTTTGAGCGT 
GAGCGCTACC AAGATTTTCT GCGAAACCTT CTGAATGAGA TCGAGCCGGG TTCACGTGCC
GGAAGGCGTT ACGTTGGCCA ATGGATCAAA GCGCCATTCA GAGCATTGGT CAATCAATAC
TGGTGCTTGG GCAAATATGT CGATCCTGAT GGTGAAGAAC TCGATTTGCT GGTTGTCGAA
GTCAACTCGT TCACCAAGCT CGAACGCGCC CGCTCAGCGC TTCGCAATTT TGCAGTCAGT
TGTCTCAAAG AGTTCGGCAA AGATCAGTCA TTGATCGCAT TTTATGCCAA AGATGATGGC
GGCGCTGACT GGCGCTTTTC ATTCGTCAAG ATCGAATACA AATCCTACAA AGATAAATCG
GGAAAAATCA AACCCGGAAA AGAGCTTACT CCCGCCCGGC GTTACTCCTA CCTGGTCGGC
AAGCGCGAAA AAAGCCATAC CGCCGGCAGG CAGTTGATGC CGATTCTGCT CATGGATTCG
GTCAATCCGC GCCTCGAAGA GATCGAACAG GCATTTTCCG TTGAGACGGT CACAGACGAG
TTTTTCGAGC AGTACAAGGC GCTCTTTCGC AATCTGAGCG ATCACCTGCA AACGCACCCA
TGGTTCCAGC GAGCCGATGA GATCGAAACG GAAGAGCAGG TCAGCCGCTT TGCCAAAAAA
CTGCTCGGTC AGATCGTCTT TCTCTATTTT TTACAGAAAA AAGGGTGGCT TGGAGCACCC
GAAAGCGGCA AATGGAAGGA TGGATCGAAA CGATTTCTGC GCGATTTGTT CGACCATGCC
GTTGAGTCCG GTGTGAACTA CTATAGCGAT GTACTCTGCT ACCTCTTTTA CGAAGCGCTT
GCCTGCGAGC GCAAAGGCCC GGTTGATTCC GGTTATTACG AGCGCTTCGA CTGCCGGGTA
CCATTTCTGA ACGGTGGCTT GTTCGAGGCC GGCGACTACG ACTGGCAGAA CGCTTCCCTC
GATATCCCCG ATGCCTTCTT TCACAACCAG GAAAAGACCA AAGCAGGCGA TACCGGCACC
GGAATTCTCG ATGTGTTTGA CCGCTACAAC TTCACCATAC GCGAAGACGA ACCCCTCGAC
AAGGAGGTTG CCGTCGATCC CGAAATGCTC GGCAAGGTAT TTGAAAAAAT GCTCTCCGAC
GGCGAACGCA AAGGCAAAGG CGCATTCTAC ACGCCTCGCG AAATCGTCCA TTACATGTGT
CAGGAAAGCC TTGTCCAATA CCTTGACAAC CGCATCAACG CTATCCCGAC CGGCTACGAA
GAACTCGGCC TGTCGCAGAT CAGTATGTTT GCCGACAGCA CACCAACAGG GCAAATTGCC
ATGACCATCG AGCATCATGG AATTCGCGTG CCGAGAAAGG ATATCGAAAA GCTGATACGC
AAGGGATACC TTTTTGTCCA GAACGATGAA GTGGCAATGG CTGCACTGCA TCGTGTCGAG
CAAGGTGATC AGAAAACAAC GAAACATACC GTTGAGCTGC CGAATACGGT CAGAATCCAT
GCCAGAGATA TCGATGCAGC ATTGGCCGAC ATCAAGATCT GCGACCCCGC CATTGGCTCC
GGTGCATTCC CCGTCGGCAT GTTGCACGAA ATCGTGGCCG CCCGCCAGGC GCTCGCCCAT
CACACCGGCA ATACGAGCAG TTCCTACGTT CTGAAACGCC ACGCCATCGC CGAAAATCTC
TATGGCGTTG ATGTTGACCC GTCGGCTATC GACATTGCTC GCTTGCGGCT CTGGCTCTCG
CTCATCGTTG ACGAAGAGGA TTACGGCACC ATCGAACCCC TGCCGAACCT CGACTACAAA
ATTATTCAGG GGGATTCGCT CATCGGCATC GACATCGATC CGCTTTTCAA CAAAAAGTTG
TTGGATGCTA TTGAAGACAA AAAGAAAGCG TTTTTCGATA CCACGGATCA CGGTATAAAG
GAAAGGCTTG GCAAAGAGAT CGATACGCTG ATTCATGAAA TAACGAACGG GAAGAAGCAC
TTCGATTTTA GAGTGTATTT CTCTGAAGTC TGGCATCTGA AAGATGGGTT CGATGTGGTC
ATTGGCAATC CGCCATACTT TAATATTGAT ATTTACGGCG CAGGATCTCC AATGCAAGCT
TACCTAAAGA AGAATTACTC TGTCTATATG GATAAGAGTG ATATTCTTTT TTATTTTTTT
GAAAAAGCTG TTTTGATTTC AAAAGGTATT GTTGCTTTTA TTACATCTAA TGCATTTCTG
AATGCAGAAA AGGCAAATAA ATTGAGGGGG TATTTGATTA ATAATAATCT TGTTAGAAGG
ATTATTAATT TTGAAAATCT TATGATTTTT CAGGAAGCCA GCATTACAAC CGCAATTACT
TTATTGTATA AGTCACAACA AAAAGCAAGA TATCTGAATT TAAACGCTTC AAAATATTCC
GATGGCGAGC TTGAGAAACT GATGAATATC GAAACTAATT ATATTGAGGC ATCTGATTGG
AATGCTGATG GATTTACTTT AATAGATAAT CATCATAAGC AAATTATTGA TAAGATTGAA
AATGGAAAGT CAAAGCTTAA ACAGCTATAT AAGATTGGAA GCGGAATGCA GACCGGCTGT
AATGATGTCT TTGTTTTTAA GAAAATACCA AGCGGTTTTC CGGAATATCT TTTAAAGAAA
AGAATGAGCG GAGAGAATAT TAATAAATAT TCTTTTTCAG AAAATTCAGA ATGGATTCTT
TATGTTGAAA ACGTTGATTT GTTTGAAAAA TTGCACGAAT CAGTAAGACA ATACCTTTTG
CTTAATCAAG ATAAGTTACG CAAAAGGGCT GATAAAAAAA GGAGAACTAC AGCTAAATGG
TGGAACTTTA CCTTTGCGAT GCATAAAGAG TATTATGGGT ATGATAAGAT CTGGTGTTCA
TATAGAGCAA ATAGTAATGT ATTTTGTTTT GATGATAAGG GCGAACATGT AGGTCTTACG
AATACCACTG TTATTTTTAA TACTAATAAT AACATATTAT TCAAATATCT TCTCGCTTTG
CTTAATTCAG AGGTGCTAAC ATTTAGATAT TATTTTATAG GCAAAAAAAC CGGGAATAAT
CTTTTAGAGT ATTTTGAAAA TGGGGTTGGT CGTCTTCCAA TAGCATTGGC AGAAAGACAA
GTTCAAGAAT ATATTTCAAT TTTTGTGGAC TATGTGAGAT TCTTATCCGA TAAGAATAAG
AATAATATGA TGACATACTT GATTGGGGTT ATCGATTGCC TCGTCTATGA ACTCTACTTT
CCTGATGAAC TCAAAGCCGC CGGGAAAGAG CTTCAGCCTC ATATCGGCAA TCTGGCGCCG
CTTGCAGACG ACATGCCCGA AGAACAAAAA CTCGCCATCA TCCAGCGCGA ATTCAACCGT
CTCTCTGATC CCTCTCACCT TGTTCGCCAG CGTCTCGACA GCCTCGACGA AGTGAATGTT
GTGCGGACTG TACGAGCAGC GCTGAGGAGG TGA
 
Protein sequence
MNKQQGKELI DKVFTHSFER ERYQDFLRNL LNEIEPGSRA GRRYVGQWIK APFRALVNQY 
WCLGKYVDPD GEELDLLVVE VNSFTKLERA RSALRNFAVS CLKEFGKDQS LIAFYAKDDG
GADWRFSFVK IEYKSYKDKS GKIKPGKELT PARRYSYLVG KREKSHTAGR QLMPILLMDS
VNPRLEEIEQ AFSVETVTDE FFEQYKALFR NLSDHLQTHP WFQRADEIET EEQVSRFAKK
LLGQIVFLYF LQKKGWLGAP ESGKWKDGSK RFLRDLFDHA VESGVNYYSD VLCYLFYEAL
ACERKGPVDS GYYERFDCRV PFLNGGLFEA GDYDWQNASL DIPDAFFHNQ EKTKAGDTGT
GILDVFDRYN FTIREDEPLD KEVAVDPEML GKVFEKMLSD GERKGKGAFY TPREIVHYMC
QESLVQYLDN RINAIPTGYE ELGLSQISMF ADSTPTGQIA MTIEHHGIRV PRKDIEKLIR
KGYLFVQNDE VAMAALHRVE QGDQKTTKHT VELPNTVRIH ARDIDAALAD IKICDPAIGS
GAFPVGMLHE IVAARQALAH HTGNTSSSYV LKRHAIAENL YGVDVDPSAI DIARLRLWLS
LIVDEEDYGT IEPLPNLDYK IIQGDSLIGI DIDPLFNKKL LDAIEDKKKA FFDTTDHGIK
ERLGKEIDTL IHEITNGKKH FDFRVYFSEV WHLKDGFDVV IGNPPYFNID IYGAGSPMQA
YLKKNYSVYM DKSDILFYFF EKAVLISKGI VAFITSNAFL NAEKANKLRG YLINNNLVRR
IINFENLMIF QEASITTAIT LLYKSQQKAR YLNLNASKYS DGELEKLMNI ETNYIEASDW
NADGFTLIDN HHKQIIDKIE NGKSKLKQLY KIGSGMQTGC NDVFVFKKIP SGFPEYLLKK
RMSGENINKY SFSENSEWIL YVENVDLFEK LHESVRQYLL LNQDKLRKRA DKKRRTTAKW
WNFTFAMHKE YYGYDKIWCS YRANSNVFCF DDKGEHVGLT NTTVIFNTNN NILFKYLLAL
LNSEVLTFRY YFIGKKTGNN LLEYFENGVG RLPIALAERQ VQEYISIFVD YVRFLSDKNK
NNMMTYLIGV IDCLVYELYF PDELKAAGKE LQPHIGNLAP LADDMPEEQK LAIIQREFNR
LSDPSHLVRQ RLDSLDEVNV VRTVRAALRR