Gene Rcas_4404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4404 
Symbol 
ID5541917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5655326 
End bp5656294 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content64% 
IMG OID640896502 
Productribokinase-like domain-containing protein 
Protein accessionYP_001434438 
Protein GI156744309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.168915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.135313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATG TCGTCTCGAT GGGAGAGTTG CTGGTTGAGT TTGTGGCGAC CATCCCAAAC 
ACGCCGCTGG CGCGCGTGCC CGGTTTCATC AAGGCGCCCG GCGGCGCGCC TGCCAATGTC
GCCGTCGGGT TACAACGCCT GGGTCTCAGC GCGCGTTTCG TCGGCAAGGT CGGCGACGAT
CCGTTTGGCA TCTACCTGCG CGAGAGCCTG GCGCAGGAAG GGGTTGATAC CCGTTTTCTG
CTGGTGGACC GAAGGGCGCG CACCACGGCG GTGTTTGTAG CGGTATGGGA CGACGGGCGC
AAAGACCTCT GCTTCTACCG CAATCCTGGC GCCGACATGC TGCTTGCGCC GGATGAGATC
GACGAGCGAA TCTTCGACGG GGCGCGCTGT TTTCATTTTG GCTCAATCGG CTTCATCGAC
GAACCGTGTG CGTCGGCGCA GCGCCGCGCA CTCGAGATTG CCTGCGCGCG CGGATTAATG
ATCACCTACG ATCCGAACTA TCGCCCGACC CTCTGGCGCA ACACCGACAC CGCGCGCGCC
GTCATCCAGG ACTCATTCCG CTTCTGCCAT CTTGCCAAGA TTAGCGAAGA AGAATGGGAG
ACGGCAACCG GCGAACGCGA CCTCGACGCT GGCATCGCGG CAGTGCTGGC GAAAGGGGTC
GAACTCCTGG TCATCAGCCG GGGGGCGCGT GGCGCCATTG CGACCAATGG CGCGTATCGC
ATCGAACTCG CGCCGCCGTC CGTGCCGGTG GTGGAAACAA CCGGCGCCGG CGACGGGTTT
ATGGCGGCCA TGATCACGCG CCTGCTGCCG GAGCGTGAGC GGGTGGGGTC ACTCGCGCGC
GTCGAACCCG GTCTTGTGCG CGAAGCGTTA ATCTTCGCCA ACGCCGTTGG CGCGTTGACC
TGCACCAAAC CGGGCGCCAT TCCGGCGCTG CCAACGCGCA CCGAGGTCGA GCGGTTTCTT
CAGCAGTGA
 
Protein sequence
MADVVSMGEL LVEFVATIPN TPLARVPGFI KAPGGAPANV AVGLQRLGLS ARFVGKVGDD 
PFGIYLRESL AQEGVDTRFL LVDRRARTTA VFVAVWDDGR KDLCFYRNPG ADMLLAPDEI
DERIFDGARC FHFGSIGFID EPCASAQRRA LEIACARGLM ITYDPNYRPT LWRNTDTARA
VIQDSFRFCH LAKISEEEWE TATGERDLDA GIAAVLAKGV ELLVISRGAR GAIATNGAYR
IELAPPSVPV VETTGAGDGF MAAMITRLLP ERERVGSLAR VEPGLVREAL IFANAVGALT
CTKPGAIPAL PTRTEVERFL QQ