Gene Sare_4777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4777 
Symbol 
ID5704444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5406888 
End bp5408033 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content73% 
IMG OID641274175 
ProductROK family protein 
Protein accessionYP_001539521 
Protein GI159040268 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000235996 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGGAGCCAC CGATCTCCGC TACGCTGCGC GACCAGACCC TCGATCTGCT CTCCAGCGGC 
GCGGCCACAT CCCGCGCCGA CCTCGTCGAG GCGTTGCAGG TCGCCCCGTC AACCGTCACC
GCCGTGGTGC GCCGGCTGCT GGAGGAGGGC GTCCTCGCGG AGGAGGGCAT GGGTCGCTCC
ACCGGTGGGC GACGCCCGCG GATCCTGCGG CTGCGAGAGA CCAAGGGAAT CCTCGCCGTC
GCAGAACTCG GCGGCCGGCA CGCCCGGGTC GGGTTGTGCA CACCCGGCGG CGAGCTGCAC
ACCACCGAGG AGGTGGCGAT CGACATCGCC GCCGGGCCCG ACGAGGTCTT CGCGGTCGTC
GGAGCCACCT TCGCGCGGCT CCAGACGGCG ACCGCACCCG GTCAGGTGCT GCTCGGGGTC
GGCGTGGCCC TCCCCGGACC GGTGGGGTTC CCCGGAGGGC GGTTGGTGGG CCCGGCCCGG
ATGCCCGGCT GGAGCGGCGT CGACGCTGGC GCCCACCTCA CCGACCGCTT CCAGGTGCCG
GTGATCGTCG AGAACGACGC CAAGGCGGCG GCGATGGGCG AGTACGTCAC CCGAGGCCCG
GAAGTCGGCG ACATGATCTA CGTCAAGGCC GGCACCGGCA TCGGCGCCTG CCTGGTCAGC
GGCGGACAGG TCCATCGTGG CGGGCGCGGC CTCAGCGGCG ACGTCACCCA CGTGCGGGTG
GCCGACAGCG GCGAGCGGCA CTGCTCCTGC GGCAGCCGGG GCTGCCTGGA GACCGTTGCC
AGCGGTGCCG CCCTGGCCCG TGAGTTGGCC GAGCAGGGTT CCTCGGCGGC CACCGTCCGG
GAGATCATCA CGGCGGTCGG CGACGCCGAC CCGACGGTCG TGACCATGGT GCGCCACGCC
GGTGGGCTGC TCGGCGTGGC GCTTTCCGGT CTGGTCAACT TCCTCAACCC CGACGCCGTC
GTCATCGGCG GTGCGCTGTC CAGCCTCGAC GTCTACGTGG CCGCGACCCG CGGCATGCTC
TACGAACGCT GCCTACCGTC CATGACCCAG TCCCTGACCA TCGAAGCCAG CGGCGCGGGC
TCGGACGCGG CCCTCATCGG CCTCGGGCAC CTGCTGCGCA CGACCGTCGA CGTCCGACCC
GCCTGA
 
Protein sequence
MEPPISATLR DQTLDLLSSG AATSRADLVE ALQVAPSTVT AVVRRLLEEG VLAEEGMGRS 
TGGRRPRILR LRETKGILAV AELGGRHARV GLCTPGGELH TTEEVAIDIA AGPDEVFAVV
GATFARLQTA TAPGQVLLGV GVALPGPVGF PGGRLVGPAR MPGWSGVDAG AHLTDRFQVP
VIVENDAKAA AMGEYVTRGP EVGDMIYVKA GTGIGACLVS GGQVHRGGRG LSGDVTHVRV
ADSGERHCSC GSRGCLETVA SGAALARELA EQGSSAATVR EIITAVGDAD PTVVTMVRHA
GGLLGVALSG LVNFLNPDAV VIGGALSSLD VYVAATRGML YERCLPSMTQ SLTIEASGAG
SDAALIGLGH LLRTTVDVRP A