Gene RoseRS_4327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4327 
Symbol 
ID5211311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5429765 
End bp5432854 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content67% 
IMG OID640597911 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001278615 
Protein GI148658410 
COG category[T] Signal transduction mechanisms
[R] General function prediction only 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.425746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCA TCCTGGTACT CGGTCCGCCG CAGATCCTTG TTGATGGCGT TGCCGTGGTT 
GTGCCGCGTC GTCGCGCGCG GGCGCTGGTC TATTACCTGG CTGCGCAGCG GCGCCCCATC
AACCGCGAGC GGTTGCTTGC CCTGTTCTGG CCCGATCACG AGCGCACAGC TGCCCGGCAA
CTGCTCCGCG CCGCCCTTCA CGGCGTGCGG CAGGCGGTTG GTCCGCTCAT TGAAGGTGAG
GAAGACCTGG CTATCAGCGA TGATGTCGAG GTGGACTACC GCACGCTGGA GGATGCGGTG
ACAGCGCCTG CCGTCGATGA GGCGGCGCTG GCGAGCGCAC TGGCGCGCTA CCGCGACGAT
CTGCTGGCTG GCTTCACCCT TCCCGATGCC GCGCCGTTTA CCGAATGGCT CGCTGCCGGG
CGTGAACACG CGCGTTCGCT GGCGGTGCGC GGTTACACCC GCCTGGCGCG CGCTGCCGGA
GCGCGTGGCG ATATGGCCGC CGCCCTGACT GCTCTCGACC GTGCGCTCGC GTTCGATCCG
CTTCAGGAAG ACCTGCAACG TGAAGCTATC CGGCTGCACT ACCTGGCAGG CGACCGCGTC
GGCGCTATTC GTCGCTATGA ACAGTTCCGG GATCTGCTCG ATGCTGAACT GGGGGTTCCG
CCGATGCGCG AGACCCGTGA CATCTACGAT GCGATCGTGA CTGATCGCCT GAATGCTGAT
CTGCCGGGTG CGTGGCGCCA TGCCGAACAG CGCGAGCGCC CGATGACGTC TGGAAACGGA
CGGGATGCGA CGACACTCCC CTTCATTGGG CGCAAGGCGG AGATGGCGAC GATCGCAACG
ATTGGGGCGG GGCGTCTGGC GCTGATCGAA GGCGAAGCCG GGGTTGGCAA GACGCGCCTG
GCGTTCGAAG CAGCGGCGCA GCACACGGCG CAGGGTGGTC TGGCGCTGGT GGCAGCGGCG
CGTGAACTGG AGCAGGGGTT GCCCTATCAG CCGTGGATTG GCGCCCTGCG CGATCTGCTG
GCGCATCCCG GCTGGAGCAT GTTGCGCGCC AGTCTGGACA TCGCGCCGGT ATGGTTCGGC
GAGGTGGCGC GCCTCTTGCC GGAACTGCTG CCAGGAGCAG CGCCGCCGAC GCAGGCGGAT
GAAGCGCGCC TCTGGGAGGG GGTGGCGCGG TTACTCATCG CGCTGGCGCA GCAGAAGCCC
CTTATGATCG TGATCGACGA CCTGCACTGG GCGGATGCCA GCAGTCTCGC TCTGCTGGGG
TACGTGCTGC GGCGCGCCGG GAATGTGCCG CTGCGCGTGG TGGCGACGGT GCGCGCAGCC
GATCATCCGG CGCCGCTGCG CACACTGCTG ACGGCATTGA TCCGCGAGGG ACGGCTGGAG
CGCGTTCTGA TCCGTCGCCT GGGCGTCGCC GAGACCGAAT CTCTGGCGCG CGCCTTGAGT
CCGCGCGATG CGGCGCGTCT GGCGTCCTGG CTCTACCGCA CCACCGAGGG TAATCCGTTC
GTGATCGCCG AACTCGTGCG CCATGCCCGC GCAACCGGGT TGCTTACTTC CGATGGACGG
TTGAGTGCAA CGCTTCCTGA TGAGCCGGTT GTGCCGGCAT CGGTGTACAG CCTGATCCAG
GATCGTCTGG CGCGTCTTTC CGATGCCGCG CGGCGTGTGA TCGACACCGC TGTGGCAGTG
GGGCGCGTGT TTTCGTTCGA TGTCGTCGCG CGCGCGGCCG CTCTTTCCGA AACAGCGGCG
CTGGACGCGA TCAATGAACT GCACGCCGCC CGTTTGATTG AGCCGCTGCC CGACGGACGC
TTTCAGTTCG ACCATAGCCT GACGATGGAA GTAGCATACC GCGAAGTCGG GGAACCCTGC
CACCGGGCAT TGCACCGGCG CGTTGCTGAA ACGCTCGAAG CCCTGAACCG TGATCGGTTG
GACGAGGTGG CTGGACAGAT CGCCTGGCAT TTCATCGAGG GAGGTGCGCC GGAACGCGCC
ACACCCTATG CGCTCCGCGC CGGTCGCCAT GCCGCAAGCG TCGCTGCGTG GACGGAGGCG
ATTGCATTCT ACGAGCAGGC GCTGGTCGGC ATCCCGCAGT CCCAGCGTTT CGATGCGCTG
ATGAGCCTGG GCGAGGCGTT GTTGACCGGC GGCAGAGCGG CGCAGGCAAC CGAACGTTTC
CGCGAAAGTC TGGCGCTTGC CCGCACTCCA GCCGAGGCGC GTCGGGCGCG GTTGAGCCTG
GCGCGATCCC TCGCACCGCA GGGGCGCTAC GCTGAAATGA TCGAAACCGT GCGCGGGCTG
GAGTACGCCG ACGACTATAA TGTTCGTGTC ACTGCCCTGT TTCTGTGGGG CACGGCGCTG
TCACTGGAAG GGTCTGATCT GGCGGGCGCC GCGCTCCGTC TGCGCGAAGC GGCGCGGTTG
CTCTCATCCC AACCTGCGCC TGATCGCGTC GCGCTTGCGC AGGTACGCTT TGAACTTGGC
GGCGTGGCAG CCCAACAGGG TGACCTGCTC ACTGCACTCG CCTGCTACCG CGAGGCGCTG
GCGGTCGCCG ATAAGGCGGC GCACGATCCG GCAGCGACAA CGTGGCGCAT CCTGGCGCGC
AATAACCTGG CGTATCACCT GCACCTGTTG GGAAACCTCG ATGAGGCGGA GCGCTGGGTG
GCGGAAGGGT TGCGTCTGGC GAATGAGTAT GGTACGCCGG GACTTCAGCC CTACCTGCTT
TCCACGCAGG GTGAAATCGC CCTTGCACGC GGCGACCTGG CCGCCGCTGA AGCCAGTTTT
ATGGCGGGGT TGACGCTTGC CAGACAGATC GAACTCCCTG AACGCATTGC CGGTATCACC
GCCAATCTCG GACTGGTCGC CATGCGGCGC GGGCAGACCG CGCTTGCGAT TCATCACCTC
TCGACGGCGC TGGCGCACGC CGATACCCTG GGGACGCGCC ACCTCGCCGC CCAGATTCGT
ATCTGGCTCG CTCCGTTGAT CCCGCCCGAT GAGGCGCGCG CAGCGCTCGC AGTCGCCCGT
GCGTTTGCAA GCAATGGTGA GCGTCGCCTC CTCCTGGCTG AAATCGACCG CCTGGAACAG
ACGCTGGCGG CATGTTCACG CCCATGTTGA
 
Protein sequence
MLTILVLGPP QILVDGVAVV VPRRRARALV YYLAAQRRPI NRERLLALFW PDHERTAARQ 
LLRAALHGVR QAVGPLIEGE EDLAISDDVE VDYRTLEDAV TAPAVDEAAL ASALARYRDD
LLAGFTLPDA APFTEWLAAG REHARSLAVR GYTRLARAAG ARGDMAAALT ALDRALAFDP
LQEDLQREAI RLHYLAGDRV GAIRRYEQFR DLLDAELGVP PMRETRDIYD AIVTDRLNAD
LPGAWRHAEQ RERPMTSGNG RDATTLPFIG RKAEMATIAT IGAGRLALIE GEAGVGKTRL
AFEAAAQHTA QGGLALVAAA RELEQGLPYQ PWIGALRDLL AHPGWSMLRA SLDIAPVWFG
EVARLLPELL PGAAPPTQAD EARLWEGVAR LLIALAQQKP LMIVIDDLHW ADASSLALLG
YVLRRAGNVP LRVVATVRAA DHPAPLRTLL TALIREGRLE RVLIRRLGVA ETESLARALS
PRDAARLASW LYRTTEGNPF VIAELVRHAR ATGLLTSDGR LSATLPDEPV VPASVYSLIQ
DRLARLSDAA RRVIDTAVAV GRVFSFDVVA RAAALSETAA LDAINELHAA RLIEPLPDGR
FQFDHSLTME VAYREVGEPC HRALHRRVAE TLEALNRDRL DEVAGQIAWH FIEGGAPERA
TPYALRAGRH AASVAAWTEA IAFYEQALVG IPQSQRFDAL MSLGEALLTG GRAAQATERF
RESLALARTP AEARRARLSL ARSLAPQGRY AEMIETVRGL EYADDYNVRV TALFLWGTAL
SLEGSDLAGA ALRLREAARL LSSQPAPDRV ALAQVRFELG GVAAQQGDLL TALACYREAL
AVADKAAHDP AATTWRILAR NNLAYHLHLL GNLDEAERWV AEGLRLANEY GTPGLQPYLL
STQGEIALAR GDLAAAEASF MAGLTLARQI ELPERIAGIT ANLGLVAMRR GQTALAIHHL
STALAHADTL GTRHLAAQIR IWLAPLIPPD EARAALAVAR AFASNGERRL LLAEIDRLEQ
TLAACSRPC