Gene Rcas_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1222 
Symbol 
ID5538689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1579902 
End bp1581395 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content61% 
IMG OID640893355 
Productaldehyde dehydrogenase 
Protein accessionYP_001431337 
Protein GI156741208 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACTC CTCCAGTATA CCAGAACCTG ATCGGCGGCA AGTTTGTCGA CTCCGCAAGC 
GGACGAACGT TCGAGAACCG CAACCCGGCG GACACGCGCG AGATTATCGG CATTTTCCAG
GACAGTGATG AGCGCGACGT ACAGGCGGCG GTCGAGGCGG CGAAGCGGGC ATACCGTTAC
TGGCGGCTGG TTCCCGCGCC GAAGCGTGGC GAAATCCTGT TCAAAGCCGC GCAGCTGCTC
GTTGAGCGCA AAGAGCAGTA CGCCCGTGAC ATGACTCGCG AGATGGGCAA GGTGCTCAAG
GAAACACGCG GTGATGTGCA GGAAGCCATC GACATGTGCT TCTTTATGGC GGGTGAAGGG
CGGCGCCTCT ATGGCCAGAC AACCCCCTCC GAAATGCCAA ACAAGTTCCA GATGTCGGTA
CGCCAGCCGG TCGGCGTCTG CGGGCTGATC ACGCCATGGA ACTTCCCGAT GGCGATTCCG
TCCTGGAAGA TCCTGCCAGC ACTGATCGTT GGCAACACGG TGGTCATCAA ACCCGCCTCC
GACACGCCGC TGTCGGTGTA CAACCTGGTC CAATGCCTGC TCGACGCCGG CATTCCCGAC
GGCGTGATCA ACATCGTCAC CGGCAGCGGA AGTCGCGTTG GCGAGCCGCT GATTCGCCAT
CCCGATGTAC AAGTCATTTC CTTTACCGGT TCGACCGAGA TCGGCAGCAA AGTCGCGCGC
GTCGGGGCCG AGGGGATGAA ACACGTCTCG CTGGAGATGG GCGGCAAAAA CCCGATGATT
GTGATGGACG ACGCCAACCT CGACCTGGTC GTCGATGGCG CGATCTGGGG CGCCTTCGGC
ACGACCGGTC AGCGCTGCAC CGCCACCTCG CGGCTGATCG CCCACCGCGC CATTGTGGGC
GAACTGACCG AGCGCCTGGC GGATCGCGCT GAGCGACTGA AGATCGGCAA CGGGCTTGAT
GAAACCGTCG AGATGGGACC ATCGATCAAC CAGAGCCAGC TCGAAACGGT GCAGCGCTAC
GTCGAAATTG GTGCGAGCGA AGGGGCGCGG CTGGTTGTCG GCGGGCGGAC GCTGCGAGAT
GGCGATTACG CCTATGGGTT CTTCCATCAG CCGACGATCT TTGCCGATGT GCAGCGTCAC
ATGCGTATTG CCCAGGAAGA GATTTTCGGT CCGGTGCTGT CGATCATCAC AGTCGATAGC
CTGGAAGAGG CGATTGATGT CGCCAACGAC GTGCCGTATG GATTGTCGTC TGCGATCTAC
ACCCGCGACG TGAATGCCGC ATTTCGCGCT ATGCGCGACC TGTACACCGG CATCGTGTAC
GTGAATGCGC CAACGATTGG CGCGGAAATC CATCTCCCCT TCGGCGGCAC CAAAGGCACC
GGCAATGGGC ACCGCGAAGG CGGCATTCAG GTGCTCGACG TCTTCAGCGA GTGGAAATCG
ATCTACGTCG ATTTTTCGGG CACGCTCCAG CGTGCGCAGA TTGATAATTA TTGA
 
Protein sequence
MSTPPVYQNL IGGKFVDSAS GRTFENRNPA DTREIIGIFQ DSDERDVQAA VEAAKRAYRY 
WRLVPAPKRG EILFKAAQLL VERKEQYARD MTREMGKVLK ETRGDVQEAI DMCFFMAGEG
RRLYGQTTPS EMPNKFQMSV RQPVGVCGLI TPWNFPMAIP SWKILPALIV GNTVVIKPAS
DTPLSVYNLV QCLLDAGIPD GVINIVTGSG SRVGEPLIRH PDVQVISFTG STEIGSKVAR
VGAEGMKHVS LEMGGKNPMI VMDDANLDLV VDGAIWGAFG TTGQRCTATS RLIAHRAIVG
ELTERLADRA ERLKIGNGLD ETVEMGPSIN QSQLETVQRY VEIGASEGAR LVVGGRTLRD
GDYAYGFFHQ PTIFADVQRH MRIAQEEIFG PVLSIITVDS LEEAIDVAND VPYGLSSAIY
TRDVNAAFRA MRDLYTGIVY VNAPTIGAEI HLPFGGTKGT GNGHREGGIQ VLDVFSEWKS
IYVDFSGTLQ RAQIDNY