Gene Cpha266_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1851 
Symbol 
ID4571193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2145318 
End bp2147018 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content50% 
IMG OID639766433 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_912291 
Protein GI119357647 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTCA TGACTACTTC TTCAGAGACC GGCTTTTGCA CCGGAAACCA GACTCCCGGA 
ATCGCTTCAG AAAAAATCTA TGCCGAAGGC ACGATCTTCC CGCTGAAAAT CGGCATGAGA
CAGATAAAGC TCAGCAAAAC CTATACAACA AAGGATCGGG AGTTCTCTTC CTTTCCGCTC
TACGATACCA GCGGCCCCTA CTCTGATCCC TCGATCGTTA CGGATCCAAA AAAAGGACTG
CTTCCGATTC GGGATACGTG GGGATTCAAT GACGGAAAAA CCTCGCCCTC TAACGACTCC
TCGCTTCCCG TAACCATCCC CGTGCGAAAC CCCCTCAAAG CAAAGGAGGG CGTTGGCATC
ACGCAAATGC ACTATGCGCG CAAAGGAGTT ATCACTCCGG AAATGGAGTA TGTCGCCATA
CGGGAGAACC AGTTGCTTGA AAGCCGGGCA TCTTCGTTTC ACGGCAATCA TAACAACGCG
AAACCTGTGA CGCCTGAATT TGTCCGGCAG GAGATTGCAT GCGGAAGAGC GATCATACCG
GCAAACATCA ATCATCCCGA ACTTGAGCCG ATGATTATCG GAAAAAACTT CCGGGTAAAA
ATCAATGCCA ATATCGGCAA CTCCGCCATG GGATCGTCAA TTGAACAAGA GGTCGAAAAA
GCTGTCTGGG CTTGCCGATG GGGAGCTGAC ACGATTATGG ATCTTAGTAC AGGAACCAGC
ATTCACAAAA CCCGTGAGTG GATAGTGAGG AACTCTCCGG TCCCTGTGGG AACCGTACCA
ATATACCAGG CACTCGAAAA AGTTGGCGGT GTTTCGGAAG CGCTCACCTG GGAGGTTTAT
CGCGATACCC TGATCGAACA GGCTGAACAG GGCGTTGATT ACTTCACCAT TCACGCAGGA
ATTCTCCTTG AGCATCTTCC CTATGCTGAA AAGCGCCTTA CCGGCATTGT ATCACGTGGA
GGCTCAATCA TGGCGAAATG GTGTCGGGCA CATAAGACGG AAAACTTTCT TTTCACCCAT
TTTGAAGATA TATGCCGCAT CCTCAAGACC TACGATATCG CCATTTCGCT CGGCGATGCC
CTCAGGCCAG GCTCAATCGG CGATGCCAAC GATGAGGCCC AGTTTGGTGA GCTTAAAGTC
CTGGGTTCCT TAACGCTCGT CGCATGGGAG CATGATGTCC AGGTAATGAT TGAAGGCCCG
GGGCACGTTC CGCTTCATCT GGTGCTTGAA AATATGGAAA AACAGCTTGA ACTCTGTCAT
GAAGCTCCAT TTTATACGTT AGGACCGCTT GTCACCGATA TCGCTGCAGG ATATGACCAC
ATCAATTCAG CAATCGGCGG CGCTCTGATT GCAAGTTACG GCTGTTCCAT GCTCTGCTAC
GTTACTCCCA AAGAGCATCT CGGACTTCCT GATAAAAACG ATGTTCGCGA AGGGGTTATC
GCTCACAGAG TTGCAGCGCA TGCCGCAGAC CTTGCAAAAG GAAACCATGC CGCATGGTTG
CGCGATGAAC TCATGAGCCG GGCCCGCTAC TCTTTTGCCT GGGAAGATCA GTTCAATCTC
TCTCTTGATC CTGAAAAAAC CCGCGAAGTT TACCGCCAGA GTATGGCTTC AAGCGTAAAC
CTGAATAAAA ACGCGGATTT TTGCACCATG TGCGGACCGG ATTTCTGTTC GATGAAAAAA
TCACGGGAGT TAAACGGGTA A
 
Protein sequence
MTLMTTSSET GFCTGNQTPG IASEKIYAEG TIFPLKIGMR QIKLSKTYTT KDREFSSFPL 
YDTSGPYSDP SIVTDPKKGL LPIRDTWGFN DGKTSPSNDS SLPVTIPVRN PLKAKEGVGI
TQMHYARKGV ITPEMEYVAI RENQLLESRA SSFHGNHNNA KPVTPEFVRQ EIACGRAIIP
ANINHPELEP MIIGKNFRVK INANIGNSAM GSSIEQEVEK AVWACRWGAD TIMDLSTGTS
IHKTREWIVR NSPVPVGTVP IYQALEKVGG VSEALTWEVY RDTLIEQAEQ GVDYFTIHAG
ILLEHLPYAE KRLTGIVSRG GSIMAKWCRA HKTENFLFTH FEDICRILKT YDIAISLGDA
LRPGSIGDAN DEAQFGELKV LGSLTLVAWE HDVQVMIEGP GHVPLHLVLE NMEKQLELCH
EAPFYTLGPL VTDIAAGYDH INSAIGGALI ASYGCSMLCY VTPKEHLGLP DKNDVREGVI
AHRVAAHAAD LAKGNHAAWL RDELMSRARY SFAWEDQFNL SLDPEKTREV YRQSMASSVN
LNKNADFCTM CGPDFCSMKK SRELNG