Gene Cphamn1_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1126 
Symbol 
ID6374801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1213935 
End bp1215002 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content51% 
IMG OID642683628 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001959545 
Protein GI189500075 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.963046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.386119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGGT TACAGGATAT ACGTGTATCA AATATTGAGC GGTTGACAAC TCCCCGCAGT 
TTAAAGGAGA AACTACCGGT AACGCAAGGG GTTGCCGATG TAGTGTGTCA GGGGCGTGAG
GAAGTCGAGG GTATTTTATC TGGTCGTGAT TCAAGGCTTC TTGTTGTTGT CGGGCCCTGT
TCCATCCATG ACATCAACGC GGCGATGGAG TATGCTCGCC GGTTGAAAGC GCTTCGTGAT
GAATTGAAGG ATGATCTCTG TATCATCATG CGGGTTTATT TTGAAAAGCC GAGGACGACC
ATCGGCTGGA AAGGATTTAT CAATGATCCA CACCTTGACG GGACGTTTGA CATTGAGCAC
GGTCTCTATT ATGCCCGTAA GCTGCTTCTG GACATCAACG CTCTTGGACT GCCTGCCGCA
ACCGAGTTTC TCGATCCGTT TACGCCGCAA TATGTGGCCG ATCTTGTCAG CTGGGCTGCG
ATCGGTGCAA GGACAATAGA ATCTCAGACC CATCGTCAGA TGGCCAGCGG CCTGTCGATG
CCGGTCGGGT TTAAAAATTC TACCGACGGG AGGGTACAGG CTGCCATTGA CGCGATACGT
TCGGCAATGC ACTCGCACAG TTTCCTGGGG ATCGATGCTG ACGGGCACAG CAGTGTTATT
ACAACAACCG GCAATCCGTA TGGTCATATG GTGCTTCGCG GTGGATCAGG ACGCCCTAAT
TATGACGCGG AAAATATCGC GGATGCTGAA AGACGTCTTG AAAAAGAGGG GCTTGATAAA
AACCTTCTGG TCGACTGCAG CCATGCCAAT TCAGGGAAAA ACTATGAACG TCAGTCAACA
GTATGGAACA GCATCATCGA GCAGCGGGTG ACCGGGACCG AGAGTATTCT CGGCGTTATG
CTTGAAAGTA ATCTTCTTTG TGGGAAACAG TCTGTTTCGA CTGATCCGTC ATCATTGCAG
TATGGCGTAT CGATTACAGA TGCCTGTATT TCATGGGAAG AGACCGCCAC GCTGCTGCGA
GACGGAGCGA TGAAACTTCA TCATTTTCTG TCCAGGGCGG AAGTGTAA
 
Protein sequence
MQRLQDIRVS NIERLTTPRS LKEKLPVTQG VADVVCQGRE EVEGILSGRD SRLLVVVGPC 
SIHDINAAME YARRLKALRD ELKDDLCIIM RVYFEKPRTT IGWKGFINDP HLDGTFDIEH
GLYYARKLLL DINALGLPAA TEFLDPFTPQ YVADLVSWAA IGARTIESQT HRQMASGLSM
PVGFKNSTDG RVQAAIDAIR SAMHSHSFLG IDADGHSSVI TTTGNPYGHM VLRGGSGRPN
YDAENIADAE RRLEKEGLDK NLLVDCSHAN SGKNYERQST VWNSIIEQRV TGTESILGVM
LESNLLCGKQ SVSTDPSSLQ YGVSITDACI SWEETATLLR DGAMKLHHFL SRAEV