Gene Rcas_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0477 
Symbol 
ID5537940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp617325 
End bp618542 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content61% 
IMG OID640892639 
ProductROK family protein 
Protein accessionYP_001430625 
Protein GI156740496 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.514713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGCT TGCCGGCAAA AGCCACACGT CAGCACACCA AAAATCATAA CAGCCGCCTG 
GTCCTGCGCA TCATCTATGA GGATGAAGCG ATCAGTCGCG CCGATATTGC GCGGAAAACC
GGATTGACTC GCACAACGGT CTCGACCGTA GTCGGTGAAT TGATCGAGCA GGGGCTGATT
GAAGAAACCG GTGCAGGTCA ATCGTCGGGC GGGCGCCTGC CGATCCTGTT GCGTGTTGCG
CACGAAGCGC ACAGTGTGCT GGCATTGAGT TTCGAGGATA CGCAGATCGT TGGAGCGCTG
GTCGATATGC GCGGCAGCAT TCAGCGGCGG GTCAGTCTGT CGCTGTACGG TTATCGACCG
GAAGAGTTGC CCGGCTACCT GACTCGACTG ATCGAGGAGT TGCGCGCCGA CGTCACCACT
CATATTCTCG GCATTGGGCT GAGTATGCCG GGGATTGTCG ATCCGGTGCA TGGCATTGTG
CGACGCGCCG TTAATTTTGG TCTCGTCGAT GTTCCGCTCC GTCAGTGGCT CCAGGATCAG
TACCGGTTGC CGGTCTATCT CGACAATGCC GCACATCTGG CGGCGCTGGC TGAGTACATG
TTCGGCGACG GCGCCGCGAG CGGCAACCTG GTTGTGATCA GCATCGGCGT CGGTATCGGC
GCCGGCATGG TTCTCAACGG AGCATTGTTT CCGGGTGATG GATTCGGCGC CGGCGAAATC
GGTCATGTCG TCGTTGCCGA CAATGGCATC CGCTGCAATT GCGGCAATGT CGGCTGTCTG
GAAACCGTCG CCAGCGTCCC GGCCATTGTG CGCGCCGCGC GGAAATGGTT CAGCGACCCG
TCATCGCGCC TGCGCGCGTT GGCGCCTTCG GCGGCTGCGG TCGATCTCGA TATCGTTCAC
CGGGCGCTCG AAGCGGGAGA CCCGGGTGTC GCGTCGGTTG TCGAGGAAGC AGGGTATTAC
CTGGGGATCG CCATCGCGCA TATCGTCGGA TTGCTCAACG TCGAGCGCAT CGTGGTGACC
GGCGCCGTCG CTGTGCTTGG GTTGCCATTC CTCGAAGCGG TCAAGGTGTC GCTGGCGCGT
CATGCCCTGG CGCCCCTGGC GGCGATGACG AAGGTGGAAC TGATTCCAGA ACGGAGTGAT
GCCGTTCTGC TTGGCATCAC GGCGATGGTG CTTGATCAGG AACTGGGATT GCTGCACACC
CGCGTTCGGC TGTCGTAA
 
Protein sequence
MKGLPAKATR QHTKNHNSRL VLRIIYEDEA ISRADIARKT GLTRTTVSTV VGELIEQGLI 
EETGAGQSSG GRLPILLRVA HEAHSVLALS FEDTQIVGAL VDMRGSIQRR VSLSLYGYRP
EELPGYLTRL IEELRADVTT HILGIGLSMP GIVDPVHGIV RRAVNFGLVD VPLRQWLQDQ
YRLPVYLDNA AHLAALAEYM FGDGAASGNL VVISIGVGIG AGMVLNGALF PGDGFGAGEI
GHVVVADNGI RCNCGNVGCL ETVASVPAIV RAARKWFSDP SSRLRALAPS AAAVDLDIVH
RALEAGDPGV ASVVEEAGYY LGIAIAHIVG LLNVERIVVT GAVAVLGLPF LEAVKVSLAR
HALAPLAAMT KVELIPERSD AVLLGITAMV LDQELGLLHT RVRLS