Gene Rcas_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0045 
Symbol 
ID5537503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp59089 
End bp60288 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID640892210 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001430201 
Protein GI156740072 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.55488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTC TCGTCCTTGG CGGCGACGGC TACCTGGGCT GGCCGACCGC ATTGCACCTC 
TCGCAGCGCG GCCACGAGGT GGCTGTCCTC GACAACTTCT CACGGCGACT CTGGGATCAC
GAACTCGGCG CTGAAAGCCT GACCCCTATT GAAACGCTGC AACAGCGCGT CGCAGTCTGG
CGTCAGATCA CCGGTAAGAT CATTACCCCG TTCATTGGCG ATCTGTGCGA TTACACGTTC
CTGGAACCGG TGATCCGCGA TTTCCAACCG GAGGCGGTTG TCCATTTCGG TGAGCAGCGG
AGCGCCCCCT ACTCGATGAT CGACCGCCAG CACGCTGTGT TCACCCACGT CAACAATGTC
GTCGGCACGC TGAACCTGCT CTACGCGCTC GCCGACCATG CGCCGGATTG CCATCTGGTC
AAACTGGGAA CAATGGGAGA GTATGGCACT CCGAATATCG ACATTGAAGA GGGGTACATC
ACCATCACAC ACAAGGGACG CACCGATACC CTCCCCTACC CCAAACAACC CGGCAGCTGG
TACCACGCGA CCAAAGTCCA CGATAGCACC AACATCCTGT TTGCCTGCCG CATTTGGGGA
TTGCGCGCAA CCGACCTGAA TCAGGGGGTC GTCTACGGTG TGGAAACGCC AGAAACAACC
ATGGACCCGC GACTGGCAAC GCGCTTCGAC TACGATGGCG TTTTCGGAAC GGCGCTCAAT
CGCTTCCTGG TGCAGGCGGT CGTCGGCCTG CCACTGACGG TCTATGGAAA AGGCGGGCAG
ACGCGCGGCT TCCTCGATAT TCGCGACACG CTGGCGTGTG TCGAGATTGC CATCCTCAAC
CCGGCGCCAC GCGGTGAACT GCGCGTGTTC AATCAGTTCA CCGAGCAGTT CAACGTCGCC
GGTCTCGCCG AAGCGGTGCG CGAAGCGGCA CAGGAGTTCG GTCTCGATGT CGCCATTCAC
CACCTGCCCA ATCCACGCGT TGAGAAAGAA GAACATTACT ACAATGCCGC AAATACGCGC
CTGCTTGATC TGGGGCTAAA GCCACATTAC CTGAGTGAGA CGCTGCTCGA ATCGGTGATG
CGCGTGGTGA TGCATCACCG TGATCGGGTG CGGCCCGAAT TGATCATGCC CGCCGTCAAC
TGGCGCCGCA CGCACAATCC GGTTCTGCCA ACCGAGGAAC CGGTCATTCA GCAGCCCTAA
 
Protein sequence
MRILVLGGDG YLGWPTALHL SQRGHEVAVL DNFSRRLWDH ELGAESLTPI ETLQQRVAVW 
RQITGKIITP FIGDLCDYTF LEPVIRDFQP EAVVHFGEQR SAPYSMIDRQ HAVFTHVNNV
VGTLNLLYAL ADHAPDCHLV KLGTMGEYGT PNIDIEEGYI TITHKGRTDT LPYPKQPGSW
YHATKVHDST NILFACRIWG LRATDLNQGV VYGVETPETT MDPRLATRFD YDGVFGTALN
RFLVQAVVGL PLTVYGKGGQ TRGFLDIRDT LACVEIAILN PAPRGELRVF NQFTEQFNVA
GLAEAVREAA QEFGLDVAIH HLPNPRVEKE EHYYNAANTR LLDLGLKPHY LSETLLESVM
RVVMHHRDRV RPELIMPAVN WRRTHNPVLP TEEPVIQQP