Gene Rcas_0518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0518 
Symbol 
ID5537981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp675367 
End bp676407 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content62% 
IMG OID640892680 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001430666 
Protein GI156740537 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA AACTGCGGGT TGGCGTATTG GGGCTGACTC ATGACCATGT GTGGAGCAAT 
CTGCGCGACC TGGCGAGCTG CGAGGATGGC ATGCTGGTCG CTGCGGCAGA CCCCAACCCC
GCGCTGCGTG AGCATATCAG GTCGTTGGGG TGCGAACAGG TTTTCGATGA CTACAGTGAT
ATGCTGGATG CAGTCAAACT CGATGCCGTG TATGTCTTCA GCGACAATCT GCGCGGCGCG
GAATTGACCG AAATGGCGGC AGCGCGTGGT CTGCATGTGA TGGTCGAGAA GCCGATGGCG
GCGACATATG CTGACGCTGC ACGTATGCGC GGCGCTGCCG CCGCCGTCGG GGTGCAGTTA
ATGGTCAACT GGCCCATCAT GTGGTGGCCC GCTGTGCAGT ATGCGCTGAA CCTGATCGCT
GAAGGGCGAA TCGGACAGGT CTGGCAAGTC AATTACCGCG CTGCCCACAT CGGTCCGCGT
GAACTCGGCT GTTCGCCGTT CTTTTGTGAA TGGCTCTACG ACCCGCGACG CAACGGCGCA
GGCGCGCTGA TGGACTACTG CTGCTACGGC GCGGCGTTGA CCTGCGCGCT GCTTGGGTTG
CCTTCGCGTG TCACGGCGCT GACCGGGCGC CTGTGGAAGC CGGACCTGCT GGCGGAAGAT
AACGCCGTGA TCGTGATGCA GCATGCACGC GCCATCAGCA CCTCCACCGC GTCGTGGACG
CAGGTCGGCG ATATGACCAG CTATGAGCCG ATGATCTATG GCAGTGAAGG GACCATTCAC
ATTCGCGAGC ATGGCCGAGA ACTGTGGCTT GCAGACCGTG AGCATCGCAA CGGCATTCGT
CTCGACATTC CTGAGCCGCC CATCGGTATG CGCAACAGCG CCGAGTATTT CCTTGGTCAC
ATTCGTTCTG GCAAGCCGAT CACCGGTCTG TGCAGCGCCG AAGTCGGGCT GATGGCGCAG
GAAGTGCTGG AGGCGGGCAT TCTTTCGGCT GCTGAAGGGC GCACCGTGTC GCTGCCGCTG
CCGCTGACGT TGCTGGCGTA G
 
Protein sequence
MADKLRVGVL GLTHDHVWSN LRDLASCEDG MLVAAADPNP ALREHIRSLG CEQVFDDYSD 
MLDAVKLDAV YVFSDNLRGA ELTEMAAARG LHVMVEKPMA ATYADAARMR GAAAAVGVQL
MVNWPIMWWP AVQYALNLIA EGRIGQVWQV NYRAAHIGPR ELGCSPFFCE WLYDPRRNGA
GALMDYCCYG AALTCALLGL PSRVTALTGR LWKPDLLAED NAVIVMQHAR AISTSTASWT
QVGDMTSYEP MIYGSEGTIH IREHGRELWL ADREHRNGIR LDIPEPPIGM RNSAEYFLGH
IRSGKPITGL CSAEVGLMAQ EVLEAGILSA AEGRTVSLPL PLTLLA