Gene RoseRS_2919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2919 
Symbol 
ID5209888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3649363 
End bp3652605 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content67% 
IMG OID640596515 
Producthypothetical protein 
Protein accessionYP_001277237 
Protein GI148657032 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCA GACGACATCC TTCCAGTATT GCACTCACCC TCGCGCTGCC TGTCGCGCTG 
GTGTTCGGCG CGCTCAACCC GGTGGCGGCG CAGCAGCCCG CGCCGACGGC TACAGCACTC
CCTGCAACCC CCGTGCCAAC TGTGGCGCCA CCCACGCCGA CCGCAGCGCC GCCCGCGCCG
ACCGTGGCGC CGCCCGTGCC GACCGTGGCG CTGCCGACCC CTGAGCCGAC GCTCACCGTG
CCGGAGGCCA TTGACCGGAT TCTCAGCGGC GAGAAGCCGG TTGAGACCCT CGTCACCCTG
TTCGCCTGGC AGCCGGCGCT GCTTGTGGTC ATCATGGCGC TGGTTGTCGC TGTTTCCGTT
GTGAAACCCT GGCGCGAGCG TCTCTTGCGC CGGGTTGATC GTCTCATCGG CGGGGTACGG
AGCGATGTTG AAGATGAGGT GCAGCGTGAG GAAGAGAAGA AGAGGCTCGA AGCCGAACGG
CAAGCTCGAC AATTCCAGGA GCAGTTGCAG GTCGGCATTA CGCACTATCT CGACTGGCTC
CAGGCGGAGT ACGGTTTTAC GCAGCCGCTC GGTATCGCCA CCGAGCAGGT GCAACTCAGT
CTCGAATCGG TTCACGTGCC GCTGCGGGTT GTCGAACGCG GCGCGATTGA GGCCCACCGC
CGGCGCATGC GCGGCGAAGA GCGGCAAGGG CCGGAGCGGG AGTTGCCCGC CGGGGAGCGT
CGCAGCAGGT ATGTGTTTGA ACTGCTGAGT GAGCCGGAGT TGCTGGCAGC GCGCCAGACG
CCTCCGACCA GAGGGGTATC AGGCGATGAT GAATCTCCGT CGCCGGTGAC TACCACGCGC
CTGTTGCTGC TTGGCGATGC GGGAAGCGGG AAAACAACGA CCCTGCGCTA TGCCGCGCTT
CGCCTGGCTG AGGCGTATCG CCGGGGCGAT GCTGCGTTGC TGGCGAGCGA TGCCGCCGGT
TTGCATCTCC ATCTGCAGCG GGCGCCGTTG CCCATCTATG TGCGCCTGAC GCTTTTTGCC
GCGTCGATCC CGGCCGATCT GCGCGAACTG CCGCCGCAGG AGCGGGAGCG CTACGCTGGC
GCGCCGGCCG ACCTGTTCCT TACGTGGCTG GATCGCGAGG CGGCAAGGCA TTGCGAGATT
CAGGAGGGCG CGCTCTCGTC GCTGATCGGG AAGAACGACG GCAACGTGCT GCTCCTGCTC
GACGGACTGG ACGAGGCGGG CGATGAACAG CGCCGCGCGT ACCTGGCGCA GGTGATTGAC
AATCTCGCGC GCCGGTATGA TAAGCAGCGC TACGTCGTCG CAAGTCGCAC GGCAGGCTAC
GGCGGGCTGG TCTACCTGCC CGACTTCCTG GAGCGGCACC TCAGCCCGCT CGATGAGCAG
GAAGCGCAGG CGCTGCTGCG CAAGTGGTTC GATGCCGTGT ATGCGCGCCT GCACGCGATC
GGGCGGCGGC GACAGGACGC CGCTGCCGAT CAGGCCGCGC AGCTCTGGGA AGTCATTGAG
CGCAATGATC GCCTGCGCGA CATGGCGACG AATCCGCTGT TGCTGACGGT GATGGCGCTG
CTCCAGTTCA ACAGCGTCCG GCTCCCCGAC CAGCGCGCGA AACTGTACGA GAAGTTGATC
GAACTCCTCC TCGACCTCTG GCGCAGGCAG AATGTTGCCA GCGACACGCT GGTGACGAGC
GTTGCGCAGC TTGCGTCCGA GCAACGCCGG CTGGAAGCGC TCGCCCTCGC GATGCAACAA
CAGCCGCAGC AGGTGCGCGA GGTGACCCTC CGCCAGGCGC AGGAATGGCT CAGCCCGCTG
TATGTCGAAC GATTGAAGAT TGACCGCGAA GAAGCCGACA GGCGGGTGCA TGATCTGCTG
CGCCGCCTTG CCGTCGACAG CGGGATCATC CAGCAGCGCG AGGAGCGCTA TGCCTTTTCG
CACTACACGT TTCAGGAGTA TCTGGCGGCG CGAGCGCTCG ACAGCCTCGA CAACCGCGAC
GGCGCGCCGG ACAGCGTGGC GTTTCTTCTG GAGCGCAGCG CAGACGCGCG CTGGCGCGAG
ACCCTGCTGC TCGCCGCCGG CTACTGGAGC AATGGTCAGC AGATCCGTAA GACGGAGCGG
CTCCTGCGGG GATTGCTCGA CAGGCGCGAT CCCGAAAACC TGCTGCTCGC CGCCGCTGCT
CTTGCCGATG TCGGCGTGGT CGAGGACCTC GCCGACCTGC GCGATGAAGC CACCGCCCGC
CTGCGCGCCC TCGCCGCCCT CACGGAGGAC TGGCGCAGCG CCGCCCACCC CGACCCCGCG
CTGCGCAACC GCGCCGCCAC CATGCTCGAC CGGCTGGATG CCGATACTGA GCGTCCGGGG
CTTGACCTGA CGAAGCCCGA CTACTGGGCG AACCGCATCG AGCCGGGGAC GTTCAGCATG
GGTGATACGA ACAGCACATA CGACCGCGAA GAGCCGCAGT TCGACTACAC CATCCGCCGG
CCCTACGCCC TGGCGCGCTT CCCGGTGACT AACCGCCAGT ACCTGCTCTT CGTCGAGGCC
CTGGCCGGGC GCGGCGCGCC CGAAGCCGTC GCGGCGGCGA ATCGGCTGAA GGATCTGATG
AAGCAGCACG GAGAAACCCC GGAAACGTAT AACGGGTTCC GCCCGTACTT CTGGCCCGGC
GCGCGCTACC GGGCCGGCGA GGGCAACCAC CCGGTGGTCG GCGTCACATG GTATGCGGCC
ACGGCCTTCG CCTGGTGGGC CGACGCCTGG CTGCGCGCCC TGGGTGTACT GAAGGAGGGC
GAGGAGGTGC GCCTGCCCAC CGAGGCCGAG TGGGAGCGGG CGGCGGCCTA CCCGCCGACC
CTGCCGGGCA GCGACCCCCG TACCGGGCGG CGCGAGTACC CCTGGGGCGC GGAGTTGACA
ACCGCGACCA GCGGGAGTAT GATTGCCAGC ATTCAGGCTA ACATCGACGA GAGCAAGATC
AGCGGAACCT CGGTGGTGGG CATCTTCCCC CACGGCGCGG CAGCCTGCGG GGCGGAGGAA
CTGGCGGGGA ATGTCTGGGA GTGGTGCAGC ACGCCACCTC TGAAGTATCC GTTCAAAGGC
GAGGTGAGCG CAGAAAGTCT TTACACAAAA AACAAACGTG CTGGTGGAAC ATACGTGCTG
CGCGGCGGCT CGTGGAACAG CCTTCGCGAC GGCGCCCGTT GCGCCTGCCG CAACGTCCTC
AACCCTGGCC ACGTCCTCGT CATCATCGGG TTTCGTCTCG CCCGTTTGTT CTCCTCTTGC
TAA
 
Protein sequence
MHIRRHPSSI ALTLALPVAL VFGALNPVAA QQPAPTATAL PATPVPTVAP PTPTAAPPAP 
TVAPPVPTVA LPTPEPTLTV PEAIDRILSG EKPVETLVTL FAWQPALLVV IMALVVAVSV
VKPWRERLLR RVDRLIGGVR SDVEDEVQRE EEKKRLEAER QARQFQEQLQ VGITHYLDWL
QAEYGFTQPL GIATEQVQLS LESVHVPLRV VERGAIEAHR RRMRGEERQG PERELPAGER
RSRYVFELLS EPELLAARQT PPTRGVSGDD ESPSPVTTTR LLLLGDAGSG KTTTLRYAAL
RLAEAYRRGD AALLASDAAG LHLHLQRAPL PIYVRLTLFA ASIPADLREL PPQERERYAG
APADLFLTWL DREAARHCEI QEGALSSLIG KNDGNVLLLL DGLDEAGDEQ RRAYLAQVID
NLARRYDKQR YVVASRTAGY GGLVYLPDFL ERHLSPLDEQ EAQALLRKWF DAVYARLHAI
GRRRQDAAAD QAAQLWEVIE RNDRLRDMAT NPLLLTVMAL LQFNSVRLPD QRAKLYEKLI
ELLLDLWRRQ NVASDTLVTS VAQLASEQRR LEALALAMQQ QPQQVREVTL RQAQEWLSPL
YVERLKIDRE EADRRVHDLL RRLAVDSGII QQREERYAFS HYTFQEYLAA RALDSLDNRD
GAPDSVAFLL ERSADARWRE TLLLAAGYWS NGQQIRKTER LLRGLLDRRD PENLLLAAAA
LADVGVVEDL ADLRDEATAR LRALAALTED WRSAAHPDPA LRNRAATMLD RLDADTERPG
LDLTKPDYWA NRIEPGTFSM GDTNSTYDRE EPQFDYTIRR PYALARFPVT NRQYLLFVEA
LAGRGAPEAV AAANRLKDLM KQHGETPETY NGFRPYFWPG ARYRAGEGNH PVVGVTWYAA
TAFAWWADAW LRALGVLKEG EEVRLPTEAE WERAAAYPPT LPGSDPRTGR REYPWGAELT
TATSGSMIAS IQANIDESKI SGTSVVGIFP HGAAACGAEE LAGNVWEWCS TPPLKYPFKG
EVSAESLYTK NKRAGGTYVL RGGSWNSLRD GARCACRNVL NPGHVLVIIG FRLARLFSSC