Gene RoseRS_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1089 
Symbol 
ID5208036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1353510 
End bp1354598 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID640594703 
ProductGntR family transcriptional regulator 
Protein accessionYP_001275447 
Protein GI148655242 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.126635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACA CCATCATTTT CACCCGTGGC GTTCCGCCAG CCGAAGCATT TCCAACCGCG 
CAGATCGCCG AATGTCTGGC GACGGCAGTC GAGACGGATG CGGCAGTCGT TCTCCAGTAC
GGTCACCAGC CCGGCTATGC ACCGCTACGG GCGCTGCTCG CCGCCGACTA CGGCGTGAAG
GACAACGAAA TCATGGTCGC CAATGGATCA TTGCAGTTGC AGGATCTGCT GGCGGCGCAT
CTGGTGCGTC CAGGGACGAC GGTGCTGACC GAACAGCCCA GTTACGACCG CGCCATTACG
ACGTTTCGCC GTCGTGGTGC GCGGGTGGTC GGTATTCCCC TCGAAGCCGA TGGGCTTGAC
GTGGCGCGTC TCGAAGCCGA GGTCAAACGG CAGACCCCGG CATTCCTCTA CACGGTTCCC
GATTTCCAGA ACCCGGCCGG GGTGACGACA TCGCTGGAAA AGCGGCGCGC AATTCTGGAC
ATCGCCGAAC GGTATGGCTT CTGGGTGATC GAGGACATTC CGTATCGATT GCTGCGCTAC
CGTGGCGAGA GCGTGCCGAT GATGCGCGCG ATCAATCCCG GACGGGTGAT CACCATCACA
TCGTTCAGCA AACTTCTCAG TCCTGGCATG CGCGTCGGCT ATCTGGTGGC GCCGTCGTCG
CTGGTGGCGG CGGTGACGAA GGAAGCGGAG AACACGTATC TTTCGCCGGT GCTCCCGACG
CAGGCAGCAG TTGCAGAGTT TATCCGGCGC GGCTGGATGG CGCCGAACAT CGAGCGGCTG
AAAGAACTCT ACCGCCCCCG CTGGGAAGCG ATGATGAACG CGGTGCGGCG CTACCTGAGC
GGCGTCGCCG CTTCCGAACC GGATGGCGGC TTCTTCATCA GCGTCACCCT GCCGGCTGAT
GCCAATACCC GCAACCTGGT TGCACGCGCG AAGGAGATCG GTCTGGTATT GACCGAAGGG
CAGGCGTTCT TTGCCGACCC TGACGAAGGT CCGGCGCCGG ATGGCGAACG CTTTGTTCGG
CTGCCGTTCT GCGCGGTGAC GCCGGAGCAG ATCGACGAGG GCGTGCGCCG ACTGGCGTCG
CTGCTGTAA
 
Protein sequence
MHDTIIFTRG VPPAEAFPTA QIAECLATAV ETDAAVVLQY GHQPGYAPLR ALLAADYGVK 
DNEIMVANGS LQLQDLLAAH LVRPGTTVLT EQPSYDRAIT TFRRRGARVV GIPLEADGLD
VARLEAEVKR QTPAFLYTVP DFQNPAGVTT SLEKRRAILD IAERYGFWVI EDIPYRLLRY
RGESVPMMRA INPGRVITIT SFSKLLSPGM RVGYLVAPSS LVAAVTKEAE NTYLSPVLPT
QAAVAEFIRR GWMAPNIERL KELYRPRWEA MMNAVRRYLS GVAASEPDGG FFISVTLPAD
ANTRNLVARA KEIGLVLTEG QAFFADPDEG PAPDGERFVR LPFCAVTPEQ IDEGVRRLAS
LL