Gene RoseRS_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2359 
Symbol 
ID5209328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2920801 
End bp2921835 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content62% 
IMG OID640595965 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001276687 
Protein GI148656482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.962709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCG TCATGCGGAG CACCGCGACC GAAGAGGAGT TGAACGCAGT CCTGACGCGC 
ATCCAGGAGC ACGGTCTCAA GGGGAGCGTC ACCTACGGTG AGGAGCGGAA CATCGTTGGC
GTCATCGGAG CAGCGATTCC GCCGACGTTG CGCGAAGAAC TCGAGCGCTT TCCGGGCGTC
CAGGAAGCAG TGCGCATCAC GCGACCGTAC AAACTGGCTG CGCGTGAGTT CCACCCCCCG
GATACCATCG TCCAGGTTGG CGATGTCGCC GTTGGCGGAG GTTCGTTCGT GGTGATTGCC
GGACCATGCG CTGTTGAAAG CGAAGCGCAG ATTATGGCGA CTGCGTTCGC AGTGCGCGAA
GCGGGCGCTC ACATGCTGCG CGGCGGCGCC TTCAAACCGC GATCCTCTCC CTACACCTTT
CGCGGTTTGG GCGAGGAAGG GTTACGACTG CTGGCGCTGG CGCGCGCCGA AACAGGATTG
CCCATCGTGA CGGAAGTGAT GACGCCGACC GATGTCGAAC TGGTGGCACG CTACGCCGAT
GTGCTCCAGA TCGGGGCGCG CAATATGCAG AACTTTCAGT TGCTGGAAGA AGCCGGTCGC
AGCGGAAAAC CGGTGCTGCT CAAACGCGGG ATGTCGGCGA CGATCGAGGA ATGGCTGCTC
TCCGCCGAAT ACATTATTGC GCAGGGCAAC CCGAACGTCA TTCTCTGCGA ACGCGGCATT
CGCACATTCG AGACAGCAAC GCGCAACACG ATGGACCTGA ACGCTGTGGC GCTCGCCAAA
CGCTGCAGTC ACCTGCCGGT GATTGCCGAT CCGTCGCATG GCACGGGCAA GTGGTACCTG
GTGCCGCCGC TTGCGCTGGC ATCACTCGCC GCCGGCGCCG ATGGCGTTAT GATCGAAGTG
CATCCCGATC CGGATCGCGC CACCTCGGAC GGCGGACAGT CGCTCACATG CGAAAACTTT
GCCGCATTGA TGCCGCAGAT GACCGCGCTT GCCGGATTGC TTGGACGGAA GAGTCACGTG
TTGGTTGCAC ATTGA
 
Protein sequence
MIVVMRSTAT EEELNAVLTR IQEHGLKGSV TYGEERNIVG VIGAAIPPTL REELERFPGV 
QEAVRITRPY KLAAREFHPP DTIVQVGDVA VGGGSFVVIA GPCAVESEAQ IMATAFAVRE
AGAHMLRGGA FKPRSSPYTF RGLGEEGLRL LALARAETGL PIVTEVMTPT DVELVARYAD
VLQIGARNMQ NFQLLEEAGR SGKPVLLKRG MSATIEEWLL SAEYIIAQGN PNVILCERGI
RTFETATRNT MDLNAVALAK RCSHLPVIAD PSHGTGKWYL VPPLALASLA AGADGVMIEV
HPDPDRATSD GGQSLTCENF AALMPQMTAL AGLLGRKSHV LVAH