Gene RoseRS_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3987 
Symbol 
ID5210970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4988555 
End bp4989715 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content60% 
IMG OID640597578 
Producthypothetical protein 
Protein accessionYP_001278284 
Protein GI148658079 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCG TCAAAATACC CGTCGCCTTA CTGATCTTGA TCGGCGGCAG GCAGACGCCA 
AATGTGCTCA GCGCCCAGTT CCTGCGCCCC GACATTATTG CGCCGATTGC TTCACGCGAG
GCGATGCGTC CAGGCGAAGC ATGGGAGAAG GTCAGGCGCG TCCTTGAGCA ACTCAGTCCA
CGGGTGCTTG ACCCGCACAC CGTCGATGCG TTCGATCTGA ACGATATTCG CGAGCAATGC
GCTGCGGCGA TGGATCGCTT TCCCGATGTC CGTTGGGTGT GCAATATCAC CTGCGCCACC
ACGATCATGA GCATTGGCGC ATATGAGGTG GGACGGATGC GCAACGCCAG CGTCTGGTAT
TTCGACACAG CCGGAAGGCG CGTTGTGACG CTGGCCGGTC AACCGCCGGA CGGCGATCCA
TACCGGCTTT CGGTGGAGAA CTATCTCCAG ATATACAATC GTGCGGCTCA ACCGACGCCA
CCTCCACCGG TGTCGTGGGT GGCACTTGCA CGACAGATGG CGCAGGCGCC TGATGACGCG
ATTGAATTTC GTGAGATACT ACGCCGTGCG AACGCCGACG CCAGATTCAC ACAACCGCGC
CGTCTTGCAG TGCTCTCCCT GACGCCGACA ATGGTTCAGT GGTGCGAACA GGCGCAGGCT
GCCGGTTTCA TCTCCGCCAT ACACCAGCAC TCGAACCATC ACGAGATACT TCAGGCAGAT
GGCGCATTCT GGGATTTCGT CAACGGCGCA TGGCTGGAGA TCTACGCCTG GGATGCAGCG
CAACGCGCCG GTTGCTTCGA TGACTGTTGT CCTGGCATCG AAATACCTGC GCAGGGCGGG
CTTTCGCCGA TGAATCAGAT CGATCTCGCG GCCACCCATG CCGCCTCGCT CCTGATCGCC
GAATGCAAGA CAGAGGCGCG ACCGTTTCGC ACCGAGCATC TCGATCAACT GCGCGCGATC
ACCAGCATGA TTGGTGGCTC ATTCGTGGGC GCATTGTTCA TCACAGCGCG CAGCCAGCAC
AAAGCTGATG CACAGGCGCT CGCTGCCTTC CGTGCGCAGG CACAGGCGCG CCAGATTGTG
GTGGTCACAG GCGATCAACT GAATCAGTTG CCGGATATTT TGACGCGCGA GGCGACCAGG
CCGACATTTC CGCGAGGTTA A
 
Protein sequence
MASVKIPVAL LILIGGRQTP NVLSAQFLRP DIIAPIASRE AMRPGEAWEK VRRVLEQLSP 
RVLDPHTVDA FDLNDIREQC AAAMDRFPDV RWVCNITCAT TIMSIGAYEV GRMRNASVWY
FDTAGRRVVT LAGQPPDGDP YRLSVENYLQ IYNRAAQPTP PPPVSWVALA RQMAQAPDDA
IEFREILRRA NADARFTQPR RLAVLSLTPT MVQWCEQAQA AGFISAIHQH SNHHEILQAD
GAFWDFVNGA WLEIYAWDAA QRAGCFDDCC PGIEIPAQGG LSPMNQIDLA ATHAASLLIA
ECKTEARPFR TEHLDQLRAI TSMIGGSFVG ALFITARSQH KADAQALAAF RAQAQARQIV
VVTGDQLNQL PDILTREATR PTFPRG