Gene Acid345_0281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0281 
Symbol 
ID4068825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp293829 
End bp294884 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content60% 
IMG OID637982282 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_589360 
Protein GI94967312 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC CGACCGAAGA CTTACGCATC CAGTGGACAA AAGTTGTTCT TCCGCCGGCG 
TTCCTCGACG AGGAACTGCC GACGACGGAG AACGCTTCAG CGACCGTCGC GAACGCGCGC
AACACGGTGC GCGACATAAT TCGTGGAGAA GATTCGCGGC TGCTGGTGGT GCTGGGACCG
TGTTCCATCC ATGACGTGAA AGCGGCGCGC GAATATGCGG CGCTGCTGAA AGACGCAATC
ACCGAGCTTT CGAACGATCT CTTCCTGGTA ATGCGGGTGT ACTTCGAGAA GCCTCGCACC
ACGATTGGGT GGAAGGGGCT GATCAACGAT CCGCACCTCG ACGAGTCGTT CAACATCAAC
GACGGCCTGC GGATTTCGCG GCATTTGCTG CTGGATCTCG CGGAGATGGG TGTTCCCGCA
GGCACGGAAT TTCTTGACAT GATTACGCCG CAGTATCTTG CGGGATTGGT GTGCTGGGGC
GCGATCGGAG CGCGTACCAC CGAGAGCCAG ATCCATCGCG AGCTGGTGAG TGGACTGTCG
TGTCCGGTGG GATTTAAGAA TGGGACGTCG GGAAATGTCG GCATCGCGAT TGAGGCGGTG
CAGTCGGCAG CACATCCGCA TACGTTCCTC GGACACACGA AGTATGGGCA GTCGGCGATC
TTCGCGACCA CGGGCAATCC GGATTGCCAT GTGATCCTGC GCGGCGGACG CAAGCTGGCA
AATTACGACG CGGCGTCCGT GAAAGAAGCG TGTGGGCTGC TGGAGAAGGC TGGGTTGCCG
CAGCGCTTGA TGATCGATTG CAGCCACGCC AACAGCAACA AGGACCACAC GCGTCAGGGC
GCGGTGGCGC GCGATGTGGC TGGGCAGATT GCCGGCGGAA ACAAAGCGAT CATCGGCGTG
ATGATTGAGA GCAACCTCGT CGGTGGCGCA CAGAAGTTCG TGAAGGGCAA GCCGCTGGTC
TACGGGCAGA GCATTACCGA CGCTTGCATT GACTGGAAAG AAACGCGGGG GCTGCTGGGG
GAACTGGCGG CTGCGGTGCG CTCTCGCCGG AAGTAG
 
Protein sequence
MIRPTEDLRI QWTKVVLPPA FLDEELPTTE NASATVANAR NTVRDIIRGE DSRLLVVLGP 
CSIHDVKAAR EYAALLKDAI TELSNDLFLV MRVYFEKPRT TIGWKGLIND PHLDESFNIN
DGLRISRHLL LDLAEMGVPA GTEFLDMITP QYLAGLVCWG AIGARTTESQ IHRELVSGLS
CPVGFKNGTS GNVGIAIEAV QSAAHPHTFL GHTKYGQSAI FATTGNPDCH VILRGGRKLA
NYDAASVKEA CGLLEKAGLP QRLMIDCSHA NSNKDHTRQG AVARDVAGQI AGGNKAIIGV
MIESNLVGGA QKFVKGKPLV YGQSITDACI DWKETRGLLG ELAAAVRSRR K