Gene RoseRS_3054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3054 
Symbol 
ID5210022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3834991 
End bp3837036 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content59% 
IMG OID640596646 
Producthypothetical protein 
Protein accessionYP_001277368 
Protein GI148657163 
COG category[S] Function unknown 
COG ID[COG1306] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000100223 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00791247 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGATGGA TTCGTGGAAG CGCTCTGGTC GGTGCTGTTC TGATCGTGCT TGCGGTTCTG 
TCCGCATGTG GGCGCCCGAA TGTTCCTCTG ACCGGTTCCG TATTCGATGC ATACACCGGC
AAACCGGTGG TCGCAACGCT GAATGTTGGC GACGTTCAGA TCACCACCGA TGCGAACGGC
GTCTATCAGA TCGACCGCTG GAGTGCGCGT GATGTTCTTC GTGTCGTTGC CAGCGGTTAC
GAACCGTTTG AGGTCAACCT TGCTTCGCTG CCGCAACTGG AACGTCCTGA ACCCCCGGCA
GTCGCTCTCG ACCCGATTAC GATTCGCCCG AATACGTTGA GCGGCGTGAT CACCGACCTG
TATGCACGCA CGCCGGTTGC GGGCGCCGTG GTGCAGGTAT CGGAGGCAAT CAGCGCCACC
ACCGATACTG ACGGGCGGTA TACGCTGACT GGCGTTCCCG AAACGTTCCG TGTCACCATA
ACCGCCGACG GATATGCGCC ACTCGTGAGT GATGTGGCGC GTTCGACATC GTTCGATACA
GCGCTCCGTC CCGATACGCT GCGCGGGACG GTCACCGATG GGTATACCGG TCAGCCAATC
GCCGGGGCGG ATGTGACCCT GGGCGATCTG AAAACGACGA CCGACAGCGA TGGGCAGTAT
ACGCTGCGAG GCGTTCCAGA AAAAGGGACA ATCACCATCA GCGCGAAGGG GTATGCCAGT
CTTGACCAAC CTGTCGAACG CACAACGACG CTCAATGTCG CCCTGCGTCC CGACACGCTT
GCCGGGGTGC TGATCGACGC GCGCACGAAA CAGCCGATCC GCAATGCAAC GGTGATTGCG
ACAGTCAATC TCCAGAGCAG CGATGTGGCG ATGACGCGGA TCGATAACAG TTCTGATGGC
GCGTTCGTTC TGGAGGGCAT CCCCGAACAG GGGTATATTC AGGTGCTGGC GCCAGGGTAT
CGCAAAGCGG TTATCGAACT GCGCCCCGGT AGCGTGCCTT CGACGATCGA ACTCGAACCG
TTCTACTCGA AGGCGCTCTA TGTGACTGCG GCAGTTGCCG CGCGCTGGAA CCTGTTGACG
AAGTATTTCG ACATTATCGA CGCGACGGAA CTGAACGCAA TTGTGATCGA TGTCAAGTCG
GATCTGCGCG ATGACCTGGG ATTGGTGTAC TACGACTCGC AGGCGCCGAT GGTGCGGGAG
TTGGGGACGT TCAAGCCCTA CATGGACCTG CGTAAAATCC TTGATGAAGC GAAACGTCGC
GGCATCTACA CCATTGCGCG CGTCCACATG TTCAGTCACG ACAATGTGCT CGCCGAGGCG
AAACCGGAGT GGGCGGCAAA AGATCGGACC ACCGGCGGAA TCTTCTATGA TTATCCGGCG
CCGGGCATCC GCTACGCCTG GCTCGACCCA TGGAACGAGA ACGTCTGGGA GTACAACATT
CAACTGTCGG TCGAGGCGGC ACTGCTCGGA TTCGATGAGA TCCAGTACGA CTACATTCGC
TTCCCGTCGC TGGAGTTTTC TCCCACCGAT AAAGATCGGT TGCTGCTGTC GCGCGAAGGA
ACTCCCGAAG AGCGCTGGGC GAATATCACC GAGGTGCTCA GGCGCTCACA TCGCGCCATC
AACGGCGCCG GCGCCTTCTT TTCGGTTGAT GTATTCGGCT ACACCTCTTT TGGTCCGTCG
AAATTGCTGG GGCAGAACCT GGGTATGATG GCTGAGTACA CCGACTATAT CAGCCCGATG
GTGTATCCTT CGCACTTCAG TCCTGGTGAA TTCGGCTTCG ACAATCCGGC GAAGTACCCC
TATGAGGTCA TCCAGAAATC CATGGCTGCG GCGCTCAGGC AGGTCGAAGG GAAGCGCGCG
CTGCTTCGCC CCTGGTTGCA GGATTTTACG CTGATCTGGG TGCCAAAGGA GTTGATCGTC
GAGTATACGC CGAAGGAAGT TCGGGCGCAA ATCCGTGCTG TCGAGGAGTT CGACGCCAGC
GCCGGCTGGA TCCTGTACGA CTCGACCAAC GTCTATCACG TGGAAGCGTT GAAGCCGGCG
GAGTAG
 
Protein sequence
MRWIRGSALV GAVLIVLAVL SACGRPNVPL TGSVFDAYTG KPVVATLNVG DVQITTDANG 
VYQIDRWSAR DVLRVVASGY EPFEVNLASL PQLERPEPPA VALDPITIRP NTLSGVITDL
YARTPVAGAV VQVSEAISAT TDTDGRYTLT GVPETFRVTI TADGYAPLVS DVARSTSFDT
ALRPDTLRGT VTDGYTGQPI AGADVTLGDL KTTTDSDGQY TLRGVPEKGT ITISAKGYAS
LDQPVERTTT LNVALRPDTL AGVLIDARTK QPIRNATVIA TVNLQSSDVA MTRIDNSSDG
AFVLEGIPEQ GYIQVLAPGY RKAVIELRPG SVPSTIELEP FYSKALYVTA AVAARWNLLT
KYFDIIDATE LNAIVIDVKS DLRDDLGLVY YDSQAPMVRE LGTFKPYMDL RKILDEAKRR
GIYTIARVHM FSHDNVLAEA KPEWAAKDRT TGGIFYDYPA PGIRYAWLDP WNENVWEYNI
QLSVEAALLG FDEIQYDYIR FPSLEFSPTD KDRLLLSREG TPEERWANIT EVLRRSHRAI
NGAGAFFSVD VFGYTSFGPS KLLGQNLGMM AEYTDYISPM VYPSHFSPGE FGFDNPAKYP
YEVIQKSMAA ALRQVEGKRA LLRPWLQDFT LIWVPKELIV EYTPKEVRAQ IRAVEEFDAS
AGWILYDSTN VYHVEALKPA E