Gene Rcas_4422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4422 
Symbol 
ID5541935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5684675 
End bp5685646 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID640896520 
Productaldo/keto reductase 
Protein accessionYP_001434456 
Protein GI156744327 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000370601 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.543562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA TACCACTGTC TCTGCCACGT CGTCCATTGG GTCGCACCGG GTTTCAGGTG 
ACGCCGCTCT GTGTCGGGTG CGCACCGCTC GGCAATATGC CCGAAACGTT CGCTTACAGC
GTTGCCGAGG ATCAGGCAAT TGCAACCCTG CTCGAAGCCT TCCGCAGTCC GATCAATTTT
TTCGACACTG CCGCGATCTA CGGCGATGGT GAGAGTGAAC GCCGGATCGG CAAGGTCCTC
GCAATGATTG GCGGGTTGCC AGATGGCGTT GTGCTGGCAA CGAAAGCCGA CCGTGATGCG
GCAACCGGCG ATTTTAGCGG CGACCAGATC AGGCGCTCGG TCGAGCGTAG CCTGACGTTG
TTGGGTCTGG ATCGCCTGCA GTTTGTGTAC ATCCACGACC CGGAGCATAC GACGTTCGAG
AATGTTATGG GCAAAGGCGG ACCATTGGAG GTCTTGCAGC GGTTCCAGGC AGAAGGGATC
ATCGCGCATA TCGGCATTTC CGGCGGTCCG ATTGACATGC TCATTCGTTA TGTCGAAACC
GGCGCATTTA TGGCAGTTGA GACGCATAAC CGCTATACCC TGCTGAACCG CTCGGCAGAA
CCGCTCCTCG ATGTGGCAGT CAGTCGGGGT GTCGCGGTAG TGAATGCAGC ACCATATGGC
AGTGGTATTC TCGCCAAAGG ACCGGACGCT TACGCGCGCT ACGCCTATCA GGACGCGCCA
CCGGCGCTTG TTGAGCGGGT GCGCGCTATG GCGGCGGTCT GCCAGGAGTA TGGTGTTCCG
CTGGCGGCTG CGGCATTGCA GTTTTCCTTG CGCGATCCGC GCATCACCTC GACCGTTGTC
GGCGTCAGTA AGCCAGAACG CATCGCCGCC ACCCTGGACC TGGCGCGCGT CCCAATTCCC
GACGACCTCT GGCAGCGCCT TGATGCCGTG GGGTTCGACA CGAACGACCC GGAGGAGCAT
CGGTTCAAGT GA
 
Protein sequence
MIDIPLSLPR RPLGRTGFQV TPLCVGCAPL GNMPETFAYS VAEDQAIATL LEAFRSPINF 
FDTAAIYGDG ESERRIGKVL AMIGGLPDGV VLATKADRDA ATGDFSGDQI RRSVERSLTL
LGLDRLQFVY IHDPEHTTFE NVMGKGGPLE VLQRFQAEGI IAHIGISGGP IDMLIRYVET
GAFMAVETHN RYTLLNRSAE PLLDVAVSRG VAVVNAAPYG SGILAKGPDA YARYAYQDAP
PALVERVRAM AAVCQEYGVP LAAAALQFSL RDPRITSTVV GVSKPERIAA TLDLARVPIP
DDLWQRLDAV GFDTNDPEEH RFK