Gene Rcas_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1559 
Symbol 
ID5539035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2001446 
End bp2002507 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content62% 
IMG OID640893697 
Producthomoserine dehydrogenase 
Protein accessionYP_001431670 
Protein GI156741541 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACCT ATCGCCTCTT CCTCAGCGGT CTAGGCAATG TGGGCCGCAG TTTTCTTGCC 
ATCATGCAGT CACAGGCGTC GCTCCTCGCC AGGCGCTATG GCGTTGCGCT GCGCCTGGTT
GCCGCAGCCG ACTCAGGGGG CGCGGCAATC TATGCGACTG GACTGGACCC GGCGACGATC
CTGGCGCTCA AACAGCGTGG TCAGAGCATT GCGGCGCTGC CGGAAAGCGG CATCTTCGGA
ATTGCTCCTG TTGAAGTCAT TCAGCGCATC GAAGCGGACA TCCTCCTCGA AGCGACGCCG
GTCAATTTGA AAACAGGGCA ACCGGGTCTT GATACCGTGC GCGCAGCGTT GCGGCGCGGT
ATGCACGCAG TGCTGGCAAA TAAAGGACCG CTGGCGCTCG CATATACAGA ACTGGCTGAC
CTCAGCGATA TGGGAGAGGC CACTGAGGAA CGCGGCGATC CGCGCGACTG GCCCGCGCTG
CGCTTCAGCG CCTGCGTCGG CGGCGCGTTG CCGACCATTG CAATTGGGCG ACGCGACCTG
GCAGGCGCGA CGATTGTGCG CGTCGAGGCG GTGCTCAACG GCACAACGCA GGGCATCCTG
CGCGCGATGG AACAGGGGAG TTCATATGCT GACGCACTGG CGGAGATGCA ACGACGCGGA
CTGGCGGAGA CCGACCCTTC CCTCGATGTC GAAGGTTGGG ATGCCGCTAG TAAACTGACT
ATTCTTGCCA ATGCGGTGTT GCGCCAACCC ACAACGCTGG CTGATGTCGC CGTGCGCGGC
ATTACGGACC TGACCACCGG AGACCTGCGC GCCGCACTGG ATCGCGGCGA ACGTATTGTG
CTCCTCTGCC TGGCGGAGCG TCGAGGCGGC GACTTCCACC TGAGCGTTCA ACCAACATCA
TTGCCGCTGA TCCATCCATT GGCGCGGATG AGCGCCGATG AGATGGGCGT AGTCTACTAC
ACCGACATCA CCGGCAGACA GACTGCGACA ACGCTGGAAA CCGATCCGAC TCCGACTGCC
GCAGCCATGC TGCGCGACAT TCTTGATATT GCTGCGCGTT GA
 
Protein sequence
MRTYRLFLSG LGNVGRSFLA IMQSQASLLA RRYGVALRLV AAADSGGAAI YATGLDPATI 
LALKQRGQSI AALPESGIFG IAPVEVIQRI EADILLEATP VNLKTGQPGL DTVRAALRRG
MHAVLANKGP LALAYTELAD LSDMGEATEE RGDPRDWPAL RFSACVGGAL PTIAIGRRDL
AGATIVRVEA VLNGTTQGIL RAMEQGSSYA DALAEMQRRG LAETDPSLDV EGWDAASKLT
ILANAVLRQP TTLADVAVRG ITDLTTGDLR AALDRGERIV LLCLAERRGG DFHLSVQPTS
LPLIHPLARM SADEMGVVYY TDITGRQTAT TLETDPTPTA AAMLRDILDI AAR