Gene Cpha266_2628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2628 
Symbol 
ID4568744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp3014774 
End bp3016171 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content49% 
IMG OID639767192 
Producthypothetical protein 
Protein accessionYP_913039 
Protein GI119358395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.945312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA ATAAAAGGAT TCGGCAGAAT AAGAAACAGC ATATTCAGGA AATTCTTCCA 
ACAAATGACA AGTTGACCGG CAGGGCTGGC TTGAGCCTGT TTGCCCTGTA TCTGCGCAAT
ATCCAATTTT TTCCGATCGT TGATCGCATG TTTGGCAGCA TGCGCAAGAA CAGCAAGGGA
TTGCCGATCA CCGAACTGTT CGTTCAAATG CTGAGCTTTT TCATGGATGG AACGAGTCGT
CATCTGGTCT GGTTCGACCA GCTTAAGGCT GATGAGAGTT ATTCGGCTGT CCTTGGTTCC
GAACGATTGG CTTCATCGCA TACCATGAAA CGGTTCTTTG GTGCATTTTC CTTTCGGCGA
GTCTACCTGT TCAGGAAGTT GTTGCAGGAT CTGTTCATCT GGCGGCTGAA CCAAACAAAA
CCCAAGGTTA TTGTGCTTGG CCTCGATACG ACGGTCTTCG ACAACAATGA TGCCGAAAAA
CGTCACGGCG TTGAACCCAC GTATAAAAAG GTCAAAGGGT TCCAGCCCCT GCAACTGAAT
TGGGGCCGTT ATGTGGTAGA CGCGGTGTTC CGTGGCGGCA AGAAGCACTC CAATCATGGC
GATACGGCCG AAAAGATGCT GCGGCATATG GTAGGGAAAA TCCGGACAGC ATACCGGGAA
GATGTTCTCA TCATTGTGCG TATGGACGCA GGGTTTTACG ACGACCAGAT CTTCAACGTC
TGTGAAGAAC TGGAGATCGG GTATCTGTGT GGCGGTAAAC AATATGCCAA CGTAATCGAT
GAAGCATCAG AGAGCATTGA TTGGCAAGCC TACAAGAAAG TAACTGATGA ACGGACAAGC
TGGATGTATA CGGATTTCAT GTGCAAACAG AAGACGTGGA AGAAAGAACG GCGGACAATC
TTCAGCACAC TTTGGGAAGA CAACGGGCAG TACTTACTCG ACGGGTTATG TCGGGATACG
GTGATCATTA CCAACATTGG CAAGGGAGAA CCAATCGACA AGCAGCTCAG CGCCATCGAA
GAAGAGCAGT GGTTCAAAGC CGAAACGATT CTGGCCCGTT ATCACGATCG GGGAACGGAT
GAACTCACTA ACCGGGCACT GAAAACCTTT GGTCATGAAC AATTGCCCTT CAAACGATTT
CCGGCAAACG CAGCATGGTA CTATCTGATG CTGCTGGGCA ACAACCTCTT TGAATCCTTC
AAGGAAGACG TGACAGCATC CGTTATATCG GTGTCGGTCT ATGCTCATAC CTTTCGTCGA
CAGTTCATCG ATACCGCCGC TCAGATCGTT TGTCATTCAG AAAAGGTGCA GATAAAAGTC
CCGAGAGCAG CTTATGAGCG GCTCCAATTC GGTAAGCTCT TTGAGATATG CCGGAATCGC
TTGCCACAAC TCTGTTAG
 
Protein sequence
MSKNKRIRQN KKQHIQEILP TNDKLTGRAG LSLFALYLRN IQFFPIVDRM FGSMRKNSKG 
LPITELFVQM LSFFMDGTSR HLVWFDQLKA DESYSAVLGS ERLASSHTMK RFFGAFSFRR
VYLFRKLLQD LFIWRLNQTK PKVIVLGLDT TVFDNNDAEK RHGVEPTYKK VKGFQPLQLN
WGRYVVDAVF RGGKKHSNHG DTAEKMLRHM VGKIRTAYRE DVLIIVRMDA GFYDDQIFNV
CEELEIGYLC GGKQYANVID EASESIDWQA YKKVTDERTS WMYTDFMCKQ KTWKKERRTI
FSTLWEDNGQ YLLDGLCRDT VIITNIGKGE PIDKQLSAIE EEQWFKAETI LARYHDRGTD
ELTNRALKTF GHEQLPFKRF PANAAWYYLM LLGNNLFESF KEDVTASVIS VSVYAHTFRR
QFIDTAAQIV CHSEKVQIKV PRAAYERLQF GKLFEICRNR LPQLC