Gene RoseRS_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3020 
Symbol 
ID5209988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3798319 
End bp3799377 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content60% 
IMG OID640596612 
Producttranscriptional regulator-like protein 
Protein accessionYP_001277334 
Protein GI148657129 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.117802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAAG GGTCTCGACG TGGCGGCAAT AAGCGCAGTT CTCAACTCAT CCATCGGCGG 
CGTCTTTTTC TGGTGCAACG TCTCATTCGC GGATCGGCGA CGGGGCGTGT TCTGATCGCT
GATGCCAATG CCGCATTCCC TGATGTGCCT GATGGGATCT ATCCATCCGA TGCGGCGGCG
GCGCTGCGTC ACGACATTGC CGCGCTGCGC CAGGAGTATA ACTGCGATAT TCAGCGTGAA
CGGAACGGTA CATATGTGCT GGTTTCACCG GGAGACCTGA CGGTGCTCGA TCTGCCAGAC
GAAGAGATCG AAGCAGTGGC GTTCCTGATC GATGCGTATA GCGAGAGCGA TCTTCCGCTT
GCCGGGCAGA TCCGCACCTT TCTGGAGCGC GTAACGCTGC TCATGCCGGA AAAACGTCGC
AACCAGTTGC GTGAAATGAC CGCATCGCCG CGGGTTGATC GTCCGCGTGC ACCGGCGCGT
GGCGTCGATA CGATGATGAA TCTGTTGAAA GGGGTGATCA GGAAGCGTGA AATCGAGTTC
GACTACCGCT CGCCGCACAC CCCCGATGGC GCCGCACTGC ACCATCGGGT CGCCCCGCTC
GAATTCGTCT ATCGCGAGGG ACACACCTAC CTGGATGCGT TCTGTCTTCA GAGCGACGTC
CCGGCGCTCC GCGAGCGATT TGTGCTCTAC CGCCTTGATC GCATCGTTCC CAACAGTGTG
CGGCGTCTGC CGAACGCGCT GCACCGCGAC TACCGGCGAC CGACGTACAC GTTGCGCTAC
TGGCTGGCGC CAGCGGTGGC GCGCACGCGC GATGTGGCGC ACTGGTTTCC GAAAAGTGAG
ATCGCCTACG CCGACGATGG TTCGGCGGAG GTGACAGCGG TCACGAACGA TCTGTGGCGC
GCTCACCAGA TATTGATGCG CTATCGTGAG CATTGTCGCG TGATCGAACC GGCGCAACTG
GTGGATATGA TGCGCGAAAG CGTACAACGC ATGGTTGCGC TCTACGCAAC CGATGCTGAA
CAGCCTGGAG GAGAACGTGA AGTTGAATCG TTTGGGTGA
 
Protein sequence
MGEGSRRGGN KRSSQLIHRR RLFLVQRLIR GSATGRVLIA DANAAFPDVP DGIYPSDAAA 
ALRHDIAALR QEYNCDIQRE RNGTYVLVSP GDLTVLDLPD EEIEAVAFLI DAYSESDLPL
AGQIRTFLER VTLLMPEKRR NQLREMTASP RVDRPRAPAR GVDTMMNLLK GVIRKREIEF
DYRSPHTPDG AALHHRVAPL EFVYREGHTY LDAFCLQSDV PALRERFVLY RLDRIVPNSV
RRLPNALHRD YRRPTYTLRY WLAPAVARTR DVAHWFPKSE IAYADDGSAE VTAVTNDLWR
AHQILMRYRE HCRVIEPAQL VDMMRESVQR MVALYATDAE QPGGEREVES FG