Gene Rcas_3120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3120 
Symbol 
ID5540616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4038966 
End bp4040117 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID640895239 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001433192 
Protein GI156743063 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC TCGAACTCGC TATCGCCGTG CGACCGCGCC TGCACCTGGC GCTGCCGCAG 
AAGAATCATC GTCTGTCGTC CATCGATCAA CTATTATCGC TGGCAAAGGA GAATGTCTCT
ATGATCGTCG TCATGCGCAG CAACGCAACC GAAGAGGAAC TGAACGCCGT TCTGACGCGC
ATTCAGGAGC ATGGGCTTAA AGGGCGCGTC ACCTATGGCG AAGAGCGGAA CATCGTTGGC
GTCATCGGCG CTGCCATTCC ACCGACGCTG CGGGAAGAAC TCGAGCGGTT CCCCGGCGTC
CAGGAAGCGG TGCGCATCAC CCGCCCCTAT AAACTTGCCG CGCGCGAGTT TCATCCCCAC
GACACGATCG TGCAAGTCGG CGATCTGGTG ATCGGCGGCG GTTCGTTTAT CGTGATCGCC
GGACCGTGCG CCGTCGAGAG CGAAGAGCAG ATTATGACGA CTGCGTTCGC CGTGCGCGAA
GCAGGCGCGC ATATGCTGCG CGGCGGCGCG TTTAAGCCGC GTTCGTCGCC GTACACCTTC
CGCGGATTAG GAGAGGAAGG GTTGCGTCTG CTGGCGCAGG CGCGCGCCGA GACCGGTCTG
CCGATCGTCA CCGAGGTGAT GACGCCAACC GACGTTGAGT TGGTGGCGCG CTACGCCGAT
GTGTTGCAGA TCGGCGCGCG CAATATGCAG AACTTCCAGT TGCTGGAGGA AGTCGGGCGC
AGTGGCAAAC CGGCGCTGCT CAAGCGCGGT ATGTCGGCGA CGATCGAGGA ATGGCTGCTC
TCCGCCGAGT ATATCATTGC CCAGGGCAAC CCGAATGTCA TCCTGTGCGA ACGCGGCATT
CGCACCTTCG AGACGGCGAC ACGCAACACG ATGGACCTGA ATGCGGTGGC GCTCGCTAAA
CGCCGGAGCC ATCTGCCGGT GATCGCCGAT CCATCGCACG GCACCGGCAA ATGGTACCTG
GCGCCGCCGC TGGCTCTGGC GTCGCTGGCA GCCGGCGCCG ACGGCGTGAT GCTCGAAGTG
CATCCCGACC CGGATCGGGC GACGTCGGAC GGCGGGCAAT CGTTGACCTG CGAAAACTTC
GCCGCGCTGA TGCCGCAAAT GACGGCGCTG GCAAACGTGC TGGGGCGGCG CGATGCGCGG
TGGCGGCGAT GA
 
Protein sequence
MTALELAIAV RPRLHLALPQ KNHRLSSIDQ LLSLAKENVS MIVVMRSNAT EEELNAVLTR 
IQEHGLKGRV TYGEERNIVG VIGAAIPPTL REELERFPGV QEAVRITRPY KLAAREFHPH
DTIVQVGDLV IGGGSFIVIA GPCAVESEEQ IMTTAFAVRE AGAHMLRGGA FKPRSSPYTF
RGLGEEGLRL LAQARAETGL PIVTEVMTPT DVELVARYAD VLQIGARNMQ NFQLLEEVGR
SGKPALLKRG MSATIEEWLL SAEYIIAQGN PNVILCERGI RTFETATRNT MDLNAVALAK
RRSHLPVIAD PSHGTGKWYL APPLALASLA AGADGVMLEV HPDPDRATSD GGQSLTCENF
AALMPQMTAL ANVLGRRDAR WRR