Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3120 |
Symbol | |
ID | 5540616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4038966 |
End bp | 4040117 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640895239 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001433192 |
Protein GI | 156743063 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCTC TCGAACTCGC TATCGCCGTG CGACCGCGCC TGCACCTGGC GCTGCCGCAG AAGAATCATC GTCTGTCGTC CATCGATCAA CTATTATCGC TGGCAAAGGA GAATGTCTCT ATGATCGTCG TCATGCGCAG CAACGCAACC GAAGAGGAAC TGAACGCCGT TCTGACGCGC ATTCAGGAGC ATGGGCTTAA AGGGCGCGTC ACCTATGGCG AAGAGCGGAA CATCGTTGGC GTCATCGGCG CTGCCATTCC ACCGACGCTG CGGGAAGAAC TCGAGCGGTT CCCCGGCGTC CAGGAAGCGG TGCGCATCAC CCGCCCCTAT AAACTTGCCG CGCGCGAGTT TCATCCCCAC GACACGATCG TGCAAGTCGG CGATCTGGTG ATCGGCGGCG GTTCGTTTAT CGTGATCGCC GGACCGTGCG CCGTCGAGAG CGAAGAGCAG ATTATGACGA CTGCGTTCGC CGTGCGCGAA GCAGGCGCGC ATATGCTGCG CGGCGGCGCG TTTAAGCCGC GTTCGTCGCC GTACACCTTC CGCGGATTAG GAGAGGAAGG GTTGCGTCTG CTGGCGCAGG CGCGCGCCGA GACCGGTCTG CCGATCGTCA CCGAGGTGAT GACGCCAACC GACGTTGAGT TGGTGGCGCG CTACGCCGAT GTGTTGCAGA TCGGCGCGCG CAATATGCAG AACTTCCAGT TGCTGGAGGA AGTCGGGCGC AGTGGCAAAC CGGCGCTGCT CAAGCGCGGT ATGTCGGCGA CGATCGAGGA ATGGCTGCTC TCCGCCGAGT ATATCATTGC CCAGGGCAAC CCGAATGTCA TCCTGTGCGA ACGCGGCATT CGCACCTTCG AGACGGCGAC ACGCAACACG ATGGACCTGA ATGCGGTGGC GCTCGCTAAA CGCCGGAGCC ATCTGCCGGT GATCGCCGAT CCATCGCACG GCACCGGCAA ATGGTACCTG GCGCCGCCGC TGGCTCTGGC GTCGCTGGCA GCCGGCGCCG ACGGCGTGAT GCTCGAAGTG CATCCCGACC CGGATCGGGC GACGTCGGAC GGCGGGCAAT CGTTGACCTG CGAAAACTTC GCCGCGCTGA TGCCGCAAAT GACGGCGCTG GCAAACGTGC TGGGGCGGCG CGATGCGCGG TGGCGGCGAT GA
|
Protein sequence | MTALELAIAV RPRLHLALPQ KNHRLSSIDQ LLSLAKENVS MIVVMRSNAT EEELNAVLTR IQEHGLKGRV TYGEERNIVG VIGAAIPPTL REELERFPGV QEAVRITRPY KLAAREFHPH DTIVQVGDLV IGGGSFIVIA GPCAVESEEQ IMTTAFAVRE AGAHMLRGGA FKPRSSPYTF RGLGEEGLRL LAQARAETGL PIVTEVMTPT DVELVARYAD VLQIGARNMQ NFQLLEEVGR SGKPALLKRG MSATIEEWLL SAEYIIAQGN PNVILCERGI RTFETATRNT MDLNAVALAK RRSHLPVIAD PSHGTGKWYL APPLALASLA AGADGVMLEV HPDPDRATSD GGQSLTCENF AALMPQMTAL ANVLGRRDAR WRR
|
| |