Gene Cphamn1_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1988 
Symbol 
ID6375680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2135481 
End bp2136611 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content49% 
IMG OID642684479 
Producthydroxyneurosporene synthase 
Protein accessionYP_001960380 
Protein GI189500910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0180674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.149508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA CTACAGAACT GGATCGGGAT ATCTGGCATA ATCTTCAGGA GCCCGGTTCC 
TATGAGTGGT GGTATTTCGA CGCGGAAGAT GAAGAACAGG GTATTTCCCT TGTCTGTATA
TGGTTTGCCG GTTTTGCTTT CTCCCCGTAC TATATGGAGC ACTATCTGGG CTGGAAACAG
AACAGACTCG CCATTTCTCC GAAAGCACTC GATTACTCCG CATTCAGCTT TCAACTCTAC
GAAAACGGAC GTGAAAGCAT TAATTTCATC AAAGAAGGGC CCTCCTCACT CTTTGAGAGC
AGCGGAAACG ATATCGATGT CCGCTTTGAG CGTAACCGAT TCTTTTATGA CTCTCAGCGG
CAGTCCTATG TACTCGATGT GGCGTTTGAT TTTCCTGCAC GCCGCAAGAA AATCGCCGCC
AAACTGGTTT TCAGCGTCAG GCACCGGTAT TCCTACAGAA AAACAGACGG GAACAACAAC
GGGAACGTTC CGCATCACGA ATGGCTGCTG ACGCTTCCAA GAGCGGATGT GACCGGTTCG
CTCACCCTTG GCGATACGCT CAGAAAACAA TCCCGCACGC TTGAGTTTCA CGGAAGAGGA
TATCATGATC ACAATCTCGG TACCATGCCG GTGCATGAAT ATATCGATAC GTGGTACTGG
GGGCGGGCAT TTTCCGAGGA ATACGACCTC ATCTACTATA TGATTTTTTT TAAAAATTCC
GCTTACCGAC CGCTTACACT CTGTATGCTG CGCGATAACG GCACCGGAGA TCTCACGGTG
TATGAAGATC TGCGTATCGA CAAGTCAGGG CTGAGGCGCG GTTTGTTCGC TCCTGTACAC
AACAGGAATC TCGGTTTTTC CTGTGATGAT TTTCGTGTCG ATATCCGTCA GAGGCAGGTT
CTTGATTCAG GCCCGTTTTA TCTGCGATAC AGTTCGAATA TCGTGTTTCA GAAAGGTGGC
CTTGAATGCA CCTCCCTGAG AGGAATATCG GAATTTCTCA GTCCCGGACG GCTTGAGATG
TCGGCGCTCA GGTTTTTTAT CCGATGCAGG GTCTGGCGCC ACGGGGTCAG GTCTGCGATG
TACGGGATGT ATAATTTTTT CAAAACCTGT TTGCATTGGA TTAAAAGGTA A
 
Protein sequence
MNITTELDRD IWHNLQEPGS YEWWYFDAED EEQGISLVCI WFAGFAFSPY YMEHYLGWKQ 
NRLAISPKAL DYSAFSFQLY ENGRESINFI KEGPSSLFES SGNDIDVRFE RNRFFYDSQR
QSYVLDVAFD FPARRKKIAA KLVFSVRHRY SYRKTDGNNN GNVPHHEWLL TLPRADVTGS
LTLGDTLRKQ SRTLEFHGRG YHDHNLGTMP VHEYIDTWYW GRAFSEEYDL IYYMIFFKNS
AYRPLTLCML RDNGTGDLTV YEDLRIDKSG LRRGLFAPVH NRNLGFSCDD FRVDIRQRQV
LDSGPFYLRY SSNIVFQKGG LECTSLRGIS EFLSPGRLEM SALRFFIRCR VWRHGVRSAM
YGMYNFFKTC LHWIKR