Gene RoseRS_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4030 
Symbol 
ID5211013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5048235 
End bp5050298 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content60% 
IMG OID640597619 
Producthypothetical protein 
Protein accessionYP_001278325 
Protein GI148658120 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATCC CGTTATTGAT CCATGTTGGC GGAAGTGTCG TCAGTTTTTG GGTCGGTCTG 
TTTCTGGGTA ATCGCACCGC GTATCATCCC TCGACGCGAT GGTTGCGCCT GACGTTTGTC
CTCACCGGTT GCTACTTCCT CCTCAGAGCG ATTGAATCCT TTCCCGATCA TCATGCCCAA
CTCCTCGCAC AAATACAACT GGCGCTGGCG CTGGTTGCGC TGGCATGCTG GTACGGCTGG
GTGATCGGCG TGCGCGGCAT AACGCGTGCA GAGCGTCGGG CTCGCTGGAT CTCTATTGCT
CTGGCGGTCG GATGTGCGCT GATCGCTGCG CTCACCCTTG TGCCATCTGT GATCATCATC
GGCGGGTACC TCGCCGTCAC GCTGGCGAGC GTCTGGTTCA GGACCTGGCG ACTGTACCGA
CGCGAAACGA TCCTGCCAGC GCAGCGGGCA GCCGGAACGT TGCTCGTTGC GACCACAATC
ATCTGCATCA GTGTGGTGAC GCTGGCAGCA GTCGCGTGGC TGCAATCAAC GCCGTCGGAG
ATCGCCGGGA TGTTGGTGCA GGCAGGTGAA GGGATGCTCT GTGGCGGCAT TGCGCTGCTG
GGGTATGGGG CGCTCATCTA CACGCGTCGG CACGCAGGGC GCAACAGCGG TCGTGATTAT
CTGGTCAGCG GACTTGCCAG CCTGGGTATT GTGACAGTGT ACGAGCTTAT CCTGATTGCC
TTTCACTACG TAACCCGGAG CGTGTTGACA GCGGAACAGG CGCTGGCGTT AGGGATCATC
CTTGTGCCGG TGCTCATTGC CACCCACTTC GGGTTCGATC AACTGCGGGA TGCGCTCGAC
TGGTTGCGTT TCGATCCGTC GTCGCGCCTG CTGCGCAGTA TGATGCGTTC CTTCACCAGA
CAGATCGGCA CGGAAAAACC GTGCGATCAG GTGGTGCGGG AGTGTCTGCA AACGCTCGCC
TGTGTCGCCG ACGTATCGCG CAGCGCGTTG TTCTGGTTCG AAGAGGACGA AGCGCGGTTG
CTTGCATCAT ACGGGCATAC ACCGTCATAT GCGCCCAAAC AGAGCGATCT TCTCGCCGTG
CGGATGCAAC CGTTCGATCA ATGCGCTGAC TACACCTATC TGCTGCCGCT CCGTGTGGCT
CGAAAGCAGC GTGGCGCGCT GTTGCTCAGC AGTCCCGGTT ATGGGCAATG GTCGCCCCGC
GAGCGCGAGC AACTCAGTAC ACTCGGCATG TTGCTGGCGG CATATATCGA TCACTCCCAC
TCCGAGCCGG TGACCATCCC GCAGCGCATT CACCGCCTCG AGGAGCAGGC GCGCGATGTG
CAGCAACTGC ACGCCGCATT GAGCGATGTG CAACCGCCCC TGGTGGTCAT AACAACGCTG
GGACGCTTCG AGGTTCAGGT CAACGGAACA TCGGCGCAGT ACCGGCGTAT GCGCATTGGG
CGGCATATGT TCCAGGGGAT GTTGATGTAT CTCGTCGCTC ACACAGGGCG TCCCGTTCGG
CGTGATGTGC TGGTCGAGGT GGCGCTGGAG CATCGCCGTG GACGCAAACA CGAAGATGAT
TTGCCGGACG AATCGCACTA CATCTCCGGA TTGCGCACAA CCTTGCAGCA TTGGGGGATG
GGCGATGCAC TGGAGGTGAC TGCCGAAACT GTCACGCTGA AACGCCATCC GTCCTGGACG
ACCGATACCG ATCAGGTGCT GGCGTTGAGC AACAGCGCCC AACAGGATAT CGCGCAGGGT
CGGATTGAGG CGGCTGTGGC GGCGCTCGAA GAAGCGCTCA GTTTCTTCCA CGGCGATTAT
TTGCCGCAGT ACGACGCCCC TGACTATCGC ATTGATCACG AACAACGAAG GTGGGAGCGT
GAACGGGTGC AGATCGAGAA GTTCCTGCTC AGGTCTTACC TGCGCCTGCC AGACCTCTCC
GCCAGGCAGC ATGCTCCAAA TATTGCAGAC GCTATTCTGA GTCGGAATGA AGATGATCCA
GAGATGGTGC GCCTGGTACT GGATGTTGCG CAGCGTCTTC AGGACAATCA TCTGCTTCAG
CGCTGCCGAT CCGTCCAGGG ATGA
 
Protein sequence
MDIPLLIHVG GSVVSFWVGL FLGNRTAYHP STRWLRLTFV LTGCYFLLRA IESFPDHHAQ 
LLAQIQLALA LVALACWYGW VIGVRGITRA ERRARWISIA LAVGCALIAA LTLVPSVIII
GGYLAVTLAS VWFRTWRLYR RETILPAQRA AGTLLVATTI ICISVVTLAA VAWLQSTPSE
IAGMLVQAGE GMLCGGIALL GYGALIYTRR HAGRNSGRDY LVSGLASLGI VTVYELILIA
FHYVTRSVLT AEQALALGII LVPVLIATHF GFDQLRDALD WLRFDPSSRL LRSMMRSFTR
QIGTEKPCDQ VVRECLQTLA CVADVSRSAL FWFEEDEARL LASYGHTPSY APKQSDLLAV
RMQPFDQCAD YTYLLPLRVA RKQRGALLLS SPGYGQWSPR EREQLSTLGM LLAAYIDHSH
SEPVTIPQRI HRLEEQARDV QQLHAALSDV QPPLVVITTL GRFEVQVNGT SAQYRRMRIG
RHMFQGMLMY LVAHTGRPVR RDVLVEVALE HRRGRKHEDD LPDESHYISG LRTTLQHWGM
GDALEVTAET VTLKRHPSWT TDTDQVLALS NSAQQDIAQG RIEAAVAALE EALSFFHGDY
LPQYDAPDYR IDHEQRRWER ERVQIEKFLL RSYLRLPDLS ARQHAPNIAD AILSRNEDDP
EMVRLVLDVA QRLQDNHLLQ RCRSVQG