Gene Acid345_1883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1883 
Symbol 
ID4073344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2261497 
End bp2262564 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content61% 
IMG OID637983892 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_590958 
Protein GI94968910 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0314468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGTCG TGATGAACGC GCACGCCACC GAGGAACAGG TCGGCGCGGT CTGTAAACGG 
ATTGAGGAAC TTGGGTTCCG CGCGCATCCG ATCCCGGGAG CGCAGCGGAC TGCCATTGGT
ATTACCGGCA ACCAGGGCGA AGTCGAACCT GGCGCCATCG AAGAACTGCC TGGCGTAGTG
GAAGTCATCC GGGTCAGCAA GCCTTACAAG CTCGTAAGCC GAGATGTGAA GGAAGACAAC
ACCGTCGTCC GCTTCGCGAA CGGCGCGACC ATCGGCAGCG AAGAACTGGC GGTCGTGGCA
GGGCCGTGCG CGATCGAGAA CCACAAGCAG GCCTTTGCGA TCGCCGAGCA CGTTGCCAAA
TCGGGAGTGC GGTTCTTCCG CGGCGGGGCG TATAAGCCGC GCACCTCGCC GTATTCGTTC
CAGGGACTGG GCGAAGAGGG CCTGAAAATT ATGGCCGAGA TTCGCGACCA GTTCGGCTTG
CTGATCGTCA CCGAGGCAGT GGACAACGAG TCGCTCGACC AGGTTGAGAA ATATGCCGAT
GTGATCCAGA TCGGCGCGCG CAACATGCAG AACTTCTCGC TCCTCAAGCG TGCCGGACGC
GCTCGCAAGC CGGTGCTGCT GAAACGAGGC ATGTCGGCGA CACTGGAAGA GTTCCTGATG
GCTGCCGAGT ACGTAATGAG CGAGGGCAAC TACAACGTTG TGCTCTGCGA GCGCGGCGTG
CGGACGTTCT CCGATTACAC ACGCAATACC CTCGACCTGA GCGTGGTGCC AGCAGTGCAT
CGGTTGAGCC ATCTGCCGAT CCTTGTGGAT CCGAGCCATG GGACGGGGGT ACGCAGCAAG
GTGACGCCGT TGTCCCGCGC ATCGGTCGCC GTTGGAGCGG ATGGGCTGAT CGTGGAAGTG
CACAACGAAC CGGACCGCGC TCTCTCCGAT GGCAAGCAGT CTCTTTATCT CGAACAGTTC
GACGAACTGA TGACACAGGT TCGGCAGATC GCGCCGGTGG TTCAGCGCAA AGTCGCCGAC
CGTGGCCTGG CGCTCACGAC GCGATTGAAT TCGGCCAGCG CCCGATGA
 
Protein sequence
MLVVMNAHAT EEQVGAVCKR IEELGFRAHP IPGAQRTAIG ITGNQGEVEP GAIEELPGVV 
EVIRVSKPYK LVSRDVKEDN TVVRFANGAT IGSEELAVVA GPCAIENHKQ AFAIAEHVAK
SGVRFFRGGA YKPRTSPYSF QGLGEEGLKI MAEIRDQFGL LIVTEAVDNE SLDQVEKYAD
VIQIGARNMQ NFSLLKRAGR ARKPVLLKRG MSATLEEFLM AAEYVMSEGN YNVVLCERGV
RTFSDYTRNT LDLSVVPAVH RLSHLPILVD PSHGTGVRSK VTPLSRASVA VGADGLIVEV
HNEPDRALSD GKQSLYLEQF DELMTQVRQI APVVQRKVAD RGLALTTRLN SASAR