Gene Acid345_3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3110 
Symbol 
ID4070224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3695932 
End bp3696954 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content65% 
IMG OID637985129 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_592185 
Protein GI94970137 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.508417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTG CAGCCACAGA ATCGGAAATC AACCACGTGA TCGACCGGGT GAAAGAGCTC 
GGCTACCAGG CGCACGTAAC GCGTGGCACC GAGAAGACGA TCGTCGCCGC CGTTGGGAGT
TCCGGCAACC GCGAACAACT GGCGGCGCTG GAGGCCGCGC CGGGCGTGGA GAACGTGGTC
GTCATCGCGC ACCCATTCAA GCTCGTCAGT ATGCAGGTGA AACAGAAACG GACCGTGGTG
AACGTGGGCG GCGTGCCGAT TGGCGGTGAG GCTTGCGTGC TTATGGCGGG GCCGTGCTCG
GTGGAGTCGC GAGAGCAATT GATGACCGTG GCCCATGCGA TCGCGGCGGC CGGGGCAACG
ATGCTGCGCG GCGGCGCATA TAAGCCGCGG ACCTCGCCGT ACGAGTTCCA GGGGCTGGGG
ACGGAGGCGC TGAAGCTGCT GCGCGAAGCG TCCGAGGCAA CGGGTCTGCC GGTCGTCACA
GAAGTGATGA GCACCGAGGA TGTGGACCTG GTGGCGGAGT ACGCGGACAT GCTGCAGGTG
GGCGCGCGCA ATATGCAGAA TTTCTCGCTG CTGCGACGAT TGGCGAAATG CGAGCGGCCG
ATTTTGCTGA AGCGGGCGCC GTCGGCAACA GTGAAGGATT GGCTGCTGGC GGCGGAGTAT
CTGCTGGCGG GCGGCAATAG CCAGGTGGTG CTGTGCGAGC GCGGGATTCG CTCGTACGAT
CCCGACATGC GAAACACGTT CGACCTGGCG GCGATTGCGC TGGCGAAACA GTTGTCGCAC
TTGCCGGTTG TCGCCGATCC GTCGCATGGG ACCGGACGAC GCGATCTGGT GCCGATCATG
GCGCGCGCGG CGGTGGCGGT TGGCGCGGAT GGCGTGATCG TCGAAGTGCA TCCGTGCCCG
GAGAAGGCGC TGTCGGACGG ACCGCAATCG CTGACCCTGC CGGAGTTCGA GAAGATGGTG
CAGTTGCTGG GGCAGCCGCT GCGTAGGCAT CTGCGGCAGG AATTGAAGGC GGCGACAGCG
TAG
 
Protein sequence
MSSAATESEI NHVIDRVKEL GYQAHVTRGT EKTIVAAVGS SGNREQLAAL EAAPGVENVV 
VIAHPFKLVS MQVKQKRTVV NVGGVPIGGE ACVLMAGPCS VESREQLMTV AHAIAAAGAT
MLRGGAYKPR TSPYEFQGLG TEALKLLREA SEATGLPVVT EVMSTEDVDL VAEYADMLQV
GARNMQNFSL LRRLAKCERP ILLKRAPSAT VKDWLLAAEY LLAGGNSQVV LCERGIRSYD
PDMRNTFDLA AIALAKQLSH LPVVADPSHG TGRRDLVPIM ARAAVAVGAD GVIVEVHPCP
EKALSDGPQS LTLPEFEKMV QLLGQPLRRH LRQELKAATA