Gene RoseRS_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1020 
Symbol 
ID5207966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1254083 
End bp1255780 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content63% 
IMG OID640594634 
Producthypothetical protein 
Protein accessionYP_001275379 
Protein GI148655174 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACCGCG CCCAATTGCG CCGCAGCGAG CAGGAATTGT GGCTGAGTGC CGCTCGTGGC 
TTGCACACCT CTGCCGCCTA CGCCCTTGTC CGTGCTGCCC GTCCCAGCGC GCTGCGCCGC
GCCGTTGAGG TCGCAGAGCT TGGCCGTGCC CGCGGTCTCG GCGAGACGCT GGCCCGTGAT
CGCAGCGATC TCTCCATCAT CGAACGCCAA TACCCGGACA TCTACGAACG CTACCGTGCG
GCAGCGGAGG TCGTGCGCAG TCTGGAGCGC ACCGACCGCG CCACGACTGA TCGCCAGGAC
GAAGCGCAAC CGTCGTTCAC CGAACTGGCG AACCGCATCC GCGCTGCCCG CGCCGATCTG
GATGCCGCCA TTGAAGCCAT TCGCGCCATT CCCGGCTATG AAGCGTTCCT CCGGCCTCCC
ACCTACGCTG AGATCGCGGC AGCCGCCCAA CCCGGCGTGC CACTGGTCTA CCTGATCACC
ACTCCCCAGG GCAGTCTGGC GCTGATCGTT CCCGATGGCA ATGCAGAACC TGAAGCGCTG
CTGCTCGACG AATTTACGGA GAACGATCTG AAAAACCTCC TGCTCGTGCG CACAGGCGAT
GAAGTGGTAG GTGGGTATCT GCCGGGTCAG TTGCTCGGTG GTAAGATGCT GGAAACGGCG
CTGGCAATGG CGCTGCCGAT CCTTGGCGAG CGGCTGATCG CGCCGCTGGC ACAACGGCTC
CGCGCCTTGA ATGCTACCGG CGTGACGCTC GTCCCCACCG GTCTGCTCAG CCTGTTGCCG
CTGCATGCGG CAACCTACCG AATTGATGGC TCCTCCCGCA GCCTGCTCGA TGAGTTTGAT
GTCGCTTATG CGCCGTCGGC GCGGGTGCTG GCAATTGCGC AACGCGAACA GCAGCGGCGG
GCAGCAAAGG GGGTGCGGCT GGCTGGTGTT GGGAATCCTA CCGGCGATCT GCGTTATGCC
AGTGCAGAGT TGCACAGTAT CTGCGATCTG CTGCCACCGG CGGCGACGAC AACGTTCTAT
GAGCAGGCTG CGACTCGTAG CGCAATCTGG TCCGCTCTCG GTGAGAGCAC CATTGGTCAC
TTTTCCTGCC ATGGCAGTTT CGCCGACGAT CCACTCGATT CGGCCTTGCA CCTGGCGGCA
GGTGATCGTA TTACGCTGCG CGACCTGGTG GCAGGCGACA CCACAGCACT GAGTAATCTA
CGTCTGGTGG CGCTCTCGGC CTGCCAGACT GCGATCACCG ACTTCGGGCG CCTGCCCGAC
GAGAGCATCG GTCTGCCGGG CGGCTTTCTG CAAGCCGGTG TGCCCGCCGT CGTCGGCACG
CTCTGGAGTG TCAACGACCT CAGCACGGCG CTGCTGATGC ACCGCTTCTA CGAACTGCAC
CTGCACGGCG ATGACGCGGC AGGGCTGGCG CCGCAACCGC CGGTGCGGGC ATTGCGCCTG
GCGCAACAGT GGCTGCGCGA TCTGACCTAC AAAGAGATGT TTGACTATTT TCAGCGGCAT
CGCCAGCTCA AAGCGGTGCG GCAGCATAAC TCGGCATCGG TGCAGGTTTC AGGTGTGCGG
ATGCCGTCTG CTCTGATCGA AGTGGGGCGC GCCCTGGCTG AGGAATATAT GCTCGATCAT
CCGAACAACC GTCCATACGC CAACCCAATA TGTTGGGCCG CCTTTACGTT CAACGGTGCA
ATGGAAGGAG CTGCATAA
 
Protein sequence
MHRAQLRRSE QELWLSAARG LHTSAAYALV RAARPSALRR AVEVAELGRA RGLGETLARD 
RSDLSIIERQ YPDIYERYRA AAEVVRSLER TDRATTDRQD EAQPSFTELA NRIRAARADL
DAAIEAIRAI PGYEAFLRPP TYAEIAAAAQ PGVPLVYLIT TPQGSLALIV PDGNAEPEAL
LLDEFTENDL KNLLLVRTGD EVVGGYLPGQ LLGGKMLETA LAMALPILGE RLIAPLAQRL
RALNATGVTL VPTGLLSLLP LHAATYRIDG SSRSLLDEFD VAYAPSARVL AIAQREQQRR
AAKGVRLAGV GNPTGDLRYA SAELHSICDL LPPAATTTFY EQAATRSAIW SALGESTIGH
FSCHGSFADD PLDSALHLAA GDRITLRDLV AGDTTALSNL RLVALSACQT AITDFGRLPD
ESIGLPGGFL QAGVPAVVGT LWSVNDLSTA LLMHRFYELH LHGDDAAGLA PQPPVRALRL
AQQWLRDLTY KEMFDYFQRH RQLKAVRQHN SASVQVSGVR MPSALIEVGR ALAEEYMLDH
PNNRPYANPI CWAAFTFNGA MEGAA