Gene Cpha266_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0401 
Symbol 
ID4568656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp445899 
End bp446879 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content49% 
IMG OID639765001 
ProductKpsF/GutQ family protein 
Protein accessionYP_910884 
Protein GI119356240 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTCAC AACCAGAAAC CGCCGCAATA ATTGACTCGG GAAAAAATAT TCTTGAACAG 
GAAGCGCAAG CAATCCATCG GATCGCAGAC AGGCTCGATG ATAATTTCGC CCGTGCCATT
GCATTGATTC TCTCTTGTAA AGGCAAGATC ATTGTCTCCG GAATGGGCAA ATCCGGAATT
ATCGGGCAGA AAATTGCGGC GACAATGGCA TCAACCGGAA CAACGGCTCT TTTTCTGCAC
CCTGCTGACG CAGCACACGG AGATCTCGGC ATCGTTTGTT CTGGAGATAT TGTCATCTGC
CTTTCAAAAA GCGGAACAAC CGAGGAGCTC AATTACATTA TACCGGCACT GAAAAAAACC
GGAGCGTCAA TCATTGCCCT GACCGGCAAC AGCCGATCCT ATCTTGCAAA GAGCGCCGAT
ATCGTTCTTG ACACTGGTAT CGAGCAGGAA GCCTGTCCTT ATGATCTTGC GCCGACAACA
TCAACAACGG CGATGCTTGC CATGGGCGAT GCCCTATCCA TGACACTGAT GCAGGCAAAA
AACTTCACCC CCGTTGATTT CGCGCTAACC CATCCAAAAG GATCACTTGG ACGAAGACTG
ACCATGAAAG TATCGGATAT CATGGCTTCC GGCGATACCA TGCCTGTGGT TAATGAAGAT
GCAGCTGTCA CCGATCTGAT TCTTGAAATG ACCTCAAAAC GCTACGGAGT CAGCGCCATT
ATCAACAAGA AAGGGGTATT GACCGGCATT TTTACCGACG GCGACCTTCG TCGGCTTGTC
CAGAAAGGTG ACGATTTTCT GAACCTGACA GCGAGGTCGG TCATGACGGC AAACCCAAAA
ACCGTTGGAG CAGAAAGGCT TGCAACCGAG TGCCTCGAAA TTCTCGAGAC CTATCGCATT
ACACAGCTCA TTGTTTGTGA TATTGATCAG CGTCCTGCAG GCATTATCCA TATTCATGAC
CTGATTTCCC TCGGGCTGTA G
 
Protein sequence
MISQPETAAI IDSGKNILEQ EAQAIHRIAD RLDDNFARAI ALILSCKGKI IVSGMGKSGI 
IGQKIAATMA STGTTALFLH PADAAHGDLG IVCSGDIVIC LSKSGTTEEL NYIIPALKKT
GASIIALTGN SRSYLAKSAD IVLDTGIEQE ACPYDLAPTT STTAMLAMGD ALSMTLMQAK
NFTPVDFALT HPKGSLGRRL TMKVSDIMAS GDTMPVVNED AAVTDLILEM TSKRYGVSAI
INKKGVLTGI FTDGDLRRLV QKGDDFLNLT ARSVMTANPK TVGAERLATE CLEILETYRI
TQLIVCDIDQ RPAGIIHIHD LISLGL