Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1893 |
Symbol | |
ID | 4570852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2194980 |
End bp | 2199725 |
Gene Length | 4746 bp |
Protein Length | 1581 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639766475 |
Product | hypothetical protein |
Protein accession | YP_912333 |
Protein GI | 119357689 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCATA TCGATACATC GGACGGAGAG AGCAAGCGGA ACCTCAGGGT AGCCGAGCGC TTTTTTCTGC ATACCTCCCG CAGCGCTTTT CTGTTTGCCG TTGCTGATGA TGAGGTTGTG AGGGACGATT GCAGCGCCAT TCTTCGCTTT TTACTCTCCG GCAAAGGCAA AACGCTCCGA ATTCACGAGT GGGAGAGGGA GGGAGAGGGG CTCTATCCTG CGGAACAGTT GCGAATCCTG CTGAAAAAAT ACCCTGATAC CGACGGTTTG ATTCTTATCG GCCTTGATGC GGCGCTCTAT CGCCATCCGG ATTTTCTTGA ACAGCTTAAT GTTGCCCGCG AAGCGCTCTC TTCCTTTGGT ATTCCGATGC TCTTCTGGCT GAGTAAGGAT TCATCGCAAA GGGTCAATCG GGAGGCGCTT GATCTGTTGA GTCAGCGGGC AGGCGGTATG CTGTATTTCA GCAACGGAGA TGAGCATGAA GCTGTTACAG TTCCGGATGT TCCGGCGGCG GTGTATGAAT CCGGCTGCGA CAAGGGTGTA CAGTCCGCTC TTGAAGCGCG GCTCAGGCTG TTGCAGCAGC AGCTCGAAGA GGCTGAACGG GAACAGGACG ACCCGTCGGA ACGGGCAACT GATATTGTGC TTGAGCTGTT GCGGTTATAT GCAAGGATTC CGGGAAGCAG CCGCTCCGTG CAATCGCTGC TGGAGCGGTA TTATCACCTC TTTGATCTTG AAAATCCGGA GGTATGTACG GTTGTTGCCG AAGCTCTTGC CGAGGCCGGA GATTCGGAGA GAGCGAGTTT GCTGTTTGAA AAGGCGTTGC CGTGGTACCG TGAGCAGGCG GAAACAAACC CCGAAGTCTG GCTCCCTTAC GAGGCAAAGA CGTTGACCGG TCTGGCTCGG GCGCATTGGG CCGCGGGCGA CGTTTCCGCA GCAGAGCAGG AGTACGGGGA TGCCTTGTCG ATCTATCGGA AAGTGGCGGC AGCCAATCCC CTGAACCGGA GATCTGAAAT TGCCGAAATA CTCAGTGAAC GGGCACATCT GTACTCGAAG AGTGGTGCGT TTTCTGCCGC CGGGCAGGAG TATGAAGAGG CGTTGAGACT CTATCGGGAG CTGGCCGCAG CCGATCCCCC GCGATGGATG CCGGAGGTTG CCCGGACGCT CAATAATCTC TCCACGGTAC AAACTGCACG TAACGATATG ACGACAGCGA CACTCGGATA TCAGGAGGCA CTGAAGATCA CGGCTGAACT TGACAATGCA GTAACCTGGC TGCATGTATC GGACTTTCAT CTGCGCGAGG GCGCTTTATA CGAGCAGGAG GTTATTCTTC GATCCCTTGT CGCATCGGTG AAGCGATTCC GGAAAGAGGG GTATCTGCCC GACCTGATTT TTGTCACAGG CGATATTGCC GAAAGCGGAA AGGCTGAAGA GTATGCGTTT GCAACGCAAT TTTTCGATGA CCTGCTTGCA GCTGCCGGGC TGGAAAAGAG GCGACTGTTT CTTGTGCCGG GTAACCACGA TGTCGATCGT ACGGTGAACG AGTTTTTGCC AAGAACGCTT GAAAGCGATC AAAGCTCCGA CAGGTTTTTC AATCCCGCCG GCGTGCCCAT ACGCAGTCAG TTTCAGAGGC TGCAACCCTT TTCGGACTGG TATAACAGCT ATTTTGCAGG TATCCGCACC TTTCCCGCTG ATACAACCAG CGCGGTTGAG GTTGTGCAGA TCAGAAAAAG CCTGGTTGCC ATTCTTTCGC TGAACAGTGC GCTCTTCTCT GTTGACAAGG ATGATCATGG AAAGCTCTTG CTTGGCCGTC GCTGTCTCGA CAAGGCGATA AAAGAGTTGC AGGAACATGC GGCTGACCTG AACATCGCGC TGCTGCACCA TCCGCTCGAC TGGCTGAGCC AGGTTGAGCG TGGAAACATC AGGGCAACTC TCGGCTTGTC AGTCGATCTG CTGCTGCATG GCCACTATCA TGAAACCGAT ACCGAGAGCA TTGTTTCACA ACATGGCGGA TACCTCAAAC TTGGAGCCGG GGCGGCATAC CAGACACGCG AGTGGCCGAA CCGCGCCATG TACGCAACCT TCCGGGGGAG TCAGGTGAGC ATTTTTCCCA TTCGCTATGA GGATACTCCT CTCGAACGCT GGACGCTTGA TACAGCGCTG TTTTCTTCGC CATCCTATAT CGGCACGTTT ACTCTTCCAA AACGCACGGA TTCTGATACT GCAGAGGCAA GCCCGGTTGC CGTTCCCGCC AGCGAGTTGT TTTATGACTA CCATGAGGCG CTTCGGGGGG AGCTTGAAAC CATCAATCTT CTCGGAACCC ATGCCCTCGA AAATGTTCCG GTAGGACTGA CTGATACCTT TGTTTCCCTG CGCATCTCCG ATACCTGGCG AAGCGACCGG CAGCGTGATG TTGCCGGAAC GTGCAATATG CCGGAACAGG ATGAGCAGAT CCGTGCACCT GAGGATGTGA TGCGCCTTGT GTTCGAGAAA AAACGGCTGA TGCTTATTGT TGGCGATCCC GGTTCCGGAA AGACGACGCT GCTCAAGCAT TACGCCCTCA CCGGTATCAA CAACAGGGAG AAGCTTGGGT TCAATGAGCC GCTTCTCGTG GTTTTTCTCC CCCTGCGTGA TCTTGTAATA GTTGATGGCG CCTTTGCGCC ACTTTCCGCT AACCTCGTCG CATGGTCGGA ACTGCACGAT CTTGAAATTC AGGAAAAGGC CTTCTCTTTC TGGCTTCAAA AGCGGACAAC CCTGCTTCTT TTTGACGGAC TTGACGAGAT CGCCGATCCG GAACAGCGCA TCAAAGCCTG CCGGTGGATT GACCGCATTG TCTCACGTTA TACAAGGGCG CGAGCGGTGG TAACATCCCG TGCAACAGGA TACCGGAAGG CAGAGGGGAT TGAACTTGCC TCCAGACACA CGCGGGCCGA TATTATGGAT TTTACCATCG ACCAGGAGAG AGAGTTTTTA GAGAGGTGGT TTGGCGCCGT GTACCGTGAC GAGCTGAAAC CGTCGTCCAT GCCGGAGGAA AAGTGGGAAA AACGCCGGAA AGAGAGAGCT TTAAAAAAAG CCGAAACGAT TCTTGCGTTT CTCGGCAAAG AGGAAAACAG GAGCCTGCAG ACTTTGGCCA GAGTCCCCAT GCTTTTGCAG ATAATGGCGA CTCTCTGGAA AGAGCGCGAA TATCTGCCGG GAAGCCGGGT GGAGCTCTAC GAGGCATCGC TTGCCTACAT TCTCGACTAT CGTGACCGCC AAAAAAATAT TGATCCGCTG CTCAAGGCAA AGGACGCTTT GCGCGTGCTG GCTCCGGTTT CACTCTGGAT GCAGGAGGAT CTTGGAAAAG ACGAGGCCGG CCGGGAGCAG ATGCAGGAGC AGATGCAGCT TGTACTCAAT ACGCTCACTC CAAGGCATAT GGCATCGGAG TTCTGCAAAA ACCTTGTTGA ACGTGCAGGT CTCCTTGTGG AGTACAGAGA TAACGAGTAC CTCTTTCGCC ACAAATCATT CAGGGAGTAC CTTGCCGGCC TTCAGCTCAA GGAAGACCGC GACAAGCCGA ATCGTATCGC AACACTTGCG GAGCATTTCG GCAACGACTG GTGGGCAGAG CCGCTCCGGT TTTTTATCGC ACAGGTCGAT GCACGGACCT TTGATCTCTT TATGCAAAAG CTCTTCGACT CGCCGGTCAG CGGAGAGATG ACGCAAAAGC AGCAGGATCT GCTGCAAACG CTCATCAGTG ATGCTCCGGC AAAAAAGGTC GATGCCTTGA AGCAGAAACT GCTTGACCCG CAAACCACCC TGAACCGGCA GCGCTATCTT CTGGATTCTC TTAAAGCCAT CGATCAGCCA GAAGCCAGGG AAGCCGTGCA GCAGTTTGCC TTGAAAGGTC TCAGCAAGGA GCGGCAGGTC GTTGCAGCAG TTGTGCAGGA CCGGCTTGGA GCAGAGTACA TTCTGATCGA CGGGGGAACC TTCAGATGCT CCTTTACGAA GCAGGAGGAA GAGATGCCCG CTCTCTATAT GGGAAAATAC ACGGTTACCA ACAGGCTCTA CCGTCGTTTT ATCGGCTATC TGGAAACAAA AGAGCCCGAT TTTGCCAGGA TTTTACCACT CAAGACCTAT CAGAAAAATC TCGATGCGCT TGCAGAGAGC ATCGAGGGAT TTACAGCGTG GCTGGATGCT GTAGAGCCGT TGTCAAAACG GCTTATATCC GGTTATGATG ACGACAAGCG GTTCAACGGG GACGATCATC CGGTAGTGGG CGTAAGCTGG TATGCCGCCC GAGCTTATTC CCTGTGGTTA TCGCTGCTGG AGAGCGATGG CGGGAGTAAT AACCGCTATC GCCTGCCGAC TGAAATGGAG TGGGAGTATG CCGCTGCAGG GCAGAGCGGG AGAGAGTACC CCTGGCCTGC AGAGAGGGGA GAGCCGACGC CGAAACTTGC CAATTATAAC GGGAATGAAG GAGGTACAAC ACCGGTAGGC CGCTATCCCG ACGGAGCAAC ACCCGAAGGG CTCTGCGATA TGGCCGGAAA CGTCTGGGAG TGGTGCATCG ACTGGTATAG CGACAGGTAT TATAAGGAGT GTGAAGAGAG AGGTATTGAA AGGAACCCGT TCGGACCGGA AACCGGTTCG CTCCGTGTTA TTCGTGGCGG GGGCTGGAAC TACGATGCCG GGTACTGTCG GTCGTCGTAT CGGAACATCT ACGCCCCCGG CCTCCGGCTC AGCTATGTGG GCTTCCGCCC GGTTTTCGTC CCGTAG
|
Protein sequence | MSHIDTSDGE SKRNLRVAER FFLHTSRSAF LFAVADDEVV RDDCSAILRF LLSGKGKTLR IHEWEREGEG LYPAEQLRIL LKKYPDTDGL ILIGLDAALY RHPDFLEQLN VAREALSSFG IPMLFWLSKD SSQRVNREAL DLLSQRAGGM LYFSNGDEHE AVTVPDVPAA VYESGCDKGV QSALEARLRL LQQQLEEAER EQDDPSERAT DIVLELLRLY ARIPGSSRSV QSLLERYYHL FDLENPEVCT VVAEALAEAG DSERASLLFE KALPWYREQA ETNPEVWLPY EAKTLTGLAR AHWAAGDVSA AEQEYGDALS IYRKVAAANP LNRRSEIAEI LSERAHLYSK SGAFSAAGQE YEEALRLYRE LAAADPPRWM PEVARTLNNL STVQTARNDM TTATLGYQEA LKITAELDNA VTWLHVSDFH LREGALYEQE VILRSLVASV KRFRKEGYLP DLIFVTGDIA ESGKAEEYAF ATQFFDDLLA AAGLEKRRLF LVPGNHDVDR TVNEFLPRTL ESDQSSDRFF NPAGVPIRSQ FQRLQPFSDW YNSYFAGIRT FPADTTSAVE VVQIRKSLVA ILSLNSALFS VDKDDHGKLL LGRRCLDKAI KELQEHAADL NIALLHHPLD WLSQVERGNI RATLGLSVDL LLHGHYHETD TESIVSQHGG YLKLGAGAAY QTREWPNRAM YATFRGSQVS IFPIRYEDTP LERWTLDTAL FSSPSYIGTF TLPKRTDSDT AEASPVAVPA SELFYDYHEA LRGELETINL LGTHALENVP VGLTDTFVSL RISDTWRSDR QRDVAGTCNM PEQDEQIRAP EDVMRLVFEK KRLMLIVGDP GSGKTTLLKH YALTGINNRE KLGFNEPLLV VFLPLRDLVI VDGAFAPLSA NLVAWSELHD LEIQEKAFSF WLQKRTTLLL FDGLDEIADP EQRIKACRWI DRIVSRYTRA RAVVTSRATG YRKAEGIELA SRHTRADIMD FTIDQEREFL ERWFGAVYRD ELKPSSMPEE KWEKRRKERA LKKAETILAF LGKEENRSLQ TLARVPMLLQ IMATLWKERE YLPGSRVELY EASLAYILDY RDRQKNIDPL LKAKDALRVL APVSLWMQED LGKDEAGREQ MQEQMQLVLN TLTPRHMASE FCKNLVERAG LLVEYRDNEY LFRHKSFREY LAGLQLKEDR DKPNRIATLA EHFGNDWWAE PLRFFIAQVD ARTFDLFMQK LFDSPVSGEM TQKQQDLLQT LISDAPAKKV DALKQKLLDP QTTLNRQRYL LDSLKAIDQP EAREAVQQFA LKGLSKERQV VAAVVQDRLG AEYILIDGGT FRCSFTKQEE EMPALYMGKY TVTNRLYRRF IGYLETKEPD FARILPLKTY QKNLDALAES IEGFTAWLDA VEPLSKRLIS GYDDDKRFNG DDHPVVGVSW YAARAYSLWL SLLESDGGSN NRYRLPTEME WEYAAAGQSG REYPWPAERG EPTPKLANYN GNEGGTTPVG RYPDGATPEG LCDMAGNVWE WCIDWYSDRY YKECEERGIE RNPFGPETGS LRVIRGGGWN YDAGYCRSSY RNIYAPGLRL SYVGFRPVFV P
|
| |