Gene Rcas_2565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2565 
Symbol 
ID5540047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3305389 
End bp3306435 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content62% 
IMG OID640894694 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001432661 
Protein GI156742532 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.472024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACCGG TCGATAACCT TCACGTTCTT GCTTTTGAGC CGCTCACCCC GCCACGCGCC 
CTGCGTGAGC GGTATCCGAT CACCGAAGCC GCAGCGCAGA CGGTCTACGA AACGCGGGAA
TCGATCAAGC GCATTGTGCG CCGTGAAGAC CAGCGCCTGC TGGCGGTTGT CGGTCCCTGT
TCGATCCACG ACACCGGAGC GGCGCTGGAG TATGCCGGGC GACTGGCGCG CCTGGCAGGC
GAGATGCGCG ACCGAATCGT GATTGTGATG CGCGCCTACT TCGAGAAACC GCGCACCACC
GTCGGATGGC GCGGTCTGAT CAACGATCCG CACCTCGACG GTTCGTTCGA CATGAACGAA
GGATTACGGC GCGCGCGCGA GTTGTTGTTG CGCATCAACG ACATTGGGCT GCCAACCGCC
ACCGAAATGC TCGACCCGAT CAGCCCGCAG TATATTACTG ACCTGATCAG CCTGACTGCC
ATCGGCGCGC GCACCGTCGA GTCGCAGACC CATCGCGCTC TCGCCAGCGG TCTCTCGATG
CCGGTTGGCT ACAAGAACAG CACCGATGGC AATGTGCAGG TGGCAGTTAA TGCATTTCTA
TCGGCGCGCC GGGCGCACTC CTTCCTTGGC ATCGATCAGG ATGGACAGAG TTGCGTGGTG
CGTACCACCG GCAATCCCGA TGGCATGATC ATCCTGCGCG GTAGCAGCGC CGGACCGAAC
TATGATGCGG CAACCGTTGT GCGCACTGAA CAGGCGATGG AGGCGGCAGG TCTATTGCCC
GCCATCATGA TCGATTGTAG CCACGCCAAT GCGGGCGGCG ATCACACGCG CCAGCCGCAC
GTCTGGCGCG AGGTGCTACG CGACCACATC GCCAGCCGCA ACGCCGTCAT CGGTATGATG
GTCGAAAGCT ATCTGTACGA AGGGAAGCAA CCGATCCTTG CCGATCGCTC ACGGCTGCGC
TACGGCGTGT CGGTGACCGA TGCGTGTGTT GGTTGGGAAA CGACCGAGCG TATGCTGATC
GAGGCATATG AGGCGCTGAA AGGTTGA
 
Protein sequence
MQPVDNLHVL AFEPLTPPRA LRERYPITEA AAQTVYETRE SIKRIVRRED QRLLAVVGPC 
SIHDTGAALE YAGRLARLAG EMRDRIVIVM RAYFEKPRTT VGWRGLINDP HLDGSFDMNE
GLRRARELLL RINDIGLPTA TEMLDPISPQ YITDLISLTA IGARTVESQT HRALASGLSM
PVGYKNSTDG NVQVAVNAFL SARRAHSFLG IDQDGQSCVV RTTGNPDGMI ILRGSSAGPN
YDAATVVRTE QAMEAAGLLP AIMIDCSHAN AGGDHTRQPH VWREVLRDHI ASRNAVIGMM
VESYLYEGKQ PILADRSRLR YGVSVTDACV GWETTERMLI EAYEALKG