Gene RoseRS_3909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3909 
Symbol 
ID5210892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4892228 
End bp4894321 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content60% 
IMG OID640597505 
Producthypothetical protein 
Protein accessionYP_001278212 
Protein GI148658007 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATACT ACAATAATAT CGACATGAAA TCATATGTTG ACTTCGAAAT TGCCCTTACT 
CCGGCGGAAA GTGGTCGTTT CTTCCTGAAA GCGCGCGGTC CGCGCGGCGA GGAAGGCGAC
GGCGAATTAC TGCTGCCCGA CGCCGATCCG CAGATCCAGG CGTTATTGAC ACGTTTGCGT
CTGCTGGATG TCGATGAGGC GACGCTCGTG CAGTTGGGGC GTGCGCTGTT CGACGCGCTC
TTCAACGGGG CAGTGCGTGA TGTCTATGTG CGCTGCCGCG GTGCACTCGC CAGCGATGAA
GGGCTGCGTC TGCGGCTCAA TATCCCTCCT TCGGCTGTAT CAGCAGCGCT CCTCCCATGG
GAATTCCTCT ACGACCCGGA TCGCGGACCG TTGAGTCTGC TTGATGCGCC GGTCGTGCGT
CATCTGCCGC AACCGGATCG CATCCCGCCA CTGACGGCGC CATTGCCGCT GCGTGTGTTG
CTGACCGCCG CGCAGACGCC GCCGCCGACT GCGGTCGAAC GTGAATTGAC GGCAGTCCAG
TCGGCGCTCG AACGCTTCGG CGATCAGGTG GCAACAACAG TCGAACCGCA TTTGACTGCG
ACCATTTTGC AGAACCGTTT GCGCGAAGGG TACCATATCT GGCATTTCGT CGGTCATGGC
GGCTTTACAT CCGACGGCGC CACTGCGTGT CTGCTGTTCG AGGACGACCT CGGCGATCCG
GAACCGGTCA GCGCGCTGCA ACTCGGCATT ATGCTCGATC GCAGCAACCT GCGGTTGGTC
GTGCTGGACG CCTGCTCAAC CGGGCAACTG GCAACCGACC CGATGCGCAG TATTGCCCCA
GCCCTGATAC GCGCCCAGAT TCCGGCGGTC GTCGCCATGC AATTTCAGGT CCCGGAAGAA
GCCACCTGCG CATTTGCCAG AGCATTCTAC CACGCGCTGG CGGACGCTTT CCCGATCGAT
TTTTGCGTTA CCGAAGGGCG GCGTGCGGTG ATGAACATCA CCGGTCTTGG GCGCGCCGAC
TGGGGTATTC CGGTGATCTA TACCCGCACT GAAGATGGGC GTCTGTTCAA CCCGCCCACT
GCGCCGCCAC CATCTTCGGT GGCGGCGGTA ACGGCTCGTT CCGTCGGAAC CGGCATGCAG
GCGCTGGAGC ATCTCATTCA GGCTGGCAGC GATGTGCGCG AAGCGGTGAT CGCGTTTCGC
GCCGATCTCC ATGCGGTCGC GCGCCAGATC GACATGCTGG CGGATTACAA AGATGTGCAC
GATCAACTCC ATTCGCTTCA GTTCCACTGC TATAACCCGA TTGTCATCGA TATGCGTCGT
CTGCCCGACG ACGACCTGGC ATGGGAGAGC ATCGCCAATT ACGATGTGAC CCTGCAAAGC
ATCGTGCGCG ACCTGGATCA GATCGTCAGG CGCGGCAGAT TGCCGGCAAG CGAATTATCG
TGGGTGAACG ACGTGCAAAG CGCACAGGTG GACATTGCTG GATCAATCGA TGCTGGCGAT
CTGAAACTGT TGAAAAAGGC GGTTCGTCTG CTCAACCGGG CGCTGACTAC TCAACCATCA
CTGATCAATG CCCGACTCAG CACCACAGCG CGTACCCTGC GACTTGCTTC GCTCGTCGAG
GGTATGACTG CGGTGCGCGA CCGGCTCGCC AGCACGGGGT ACGACGCCGC CAGAGTCAGC
CAGATTGAAC TTGGCGTTAC GGCGCTTGCC ACGCTCAGCG CCAGCCTGGC CAGTCTGGTC
GATGATCATG ATCGCTGGCA GATCATCGAC CTCGAACTGC GGCGCATCGA GCAGTTCATC
GATCAGGACA TCACCGAACT GGAACTCTCG TGGCCCGAGG TTCACCAGCG AGTGGCGCCA
TTCTACCTGG AGAACGCTGA GCCATGGGGC GCTGCATTGA AGAACGACGC TGACAAACTG
ACCAGCGCGC TGGAAGCAGC CGATCCGACG CGCGTGCGGC AGTTCTTTCG CCGTTTTCGC
CGGCAGGCGG GTGAACGCTT TTTCCGCGTC GATATTGAAC TTAAGCGCGT GTGTGACGAA
TTACGACTGA TCGGCGAGTC GCTGACGTCT GTCCTTAAGG TACTGACTGT ATGA
 
Protein sequence
MVYYNNIDMK SYVDFEIALT PAESGRFFLK ARGPRGEEGD GELLLPDADP QIQALLTRLR 
LLDVDEATLV QLGRALFDAL FNGAVRDVYV RCRGALASDE GLRLRLNIPP SAVSAALLPW
EFLYDPDRGP LSLLDAPVVR HLPQPDRIPP LTAPLPLRVL LTAAQTPPPT AVERELTAVQ
SALERFGDQV ATTVEPHLTA TILQNRLREG YHIWHFVGHG GFTSDGATAC LLFEDDLGDP
EPVSALQLGI MLDRSNLRLV VLDACSTGQL ATDPMRSIAP ALIRAQIPAV VAMQFQVPEE
ATCAFARAFY HALADAFPID FCVTEGRRAV MNITGLGRAD WGIPVIYTRT EDGRLFNPPT
APPPSSVAAV TARSVGTGMQ ALEHLIQAGS DVREAVIAFR ADLHAVARQI DMLADYKDVH
DQLHSLQFHC YNPIVIDMRR LPDDDLAWES IANYDVTLQS IVRDLDQIVR RGRLPASELS
WVNDVQSAQV DIAGSIDAGD LKLLKKAVRL LNRALTTQPS LINARLSTTA RTLRLASLVE
GMTAVRDRLA STGYDAARVS QIELGVTALA TLSASLASLV DDHDRWQIID LELRRIEQFI
DQDITELELS WPEVHQRVAP FYLENAEPWG AALKNDADKL TSALEAADPT RVRQFFRRFR
RQAGERFFRV DIELKRVCDE LRLIGESLTS VLKVLTV