Gene RoseRS_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3871 
Symbol 
ID5210853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4841999 
End bp4845478 
Gene Length3480 bp 
Protein Length1159 aa 
Translation table11 
GC content63% 
IMG OID640597466 
Productlaminin G, domain-containing 2 
Protein accessionYP_001278174 
Protein GI148657969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG TAAGGTTGCC AGTTGCCTCA GTCATCGTGC TGGTCGTCGC CATTGCATCC 
CTCATAGGTG TGACGTTGCT GCTGCCGGGC AACGTTGCGG CGCAGCAGGG ATACTCGCTG
CGTTTCTACG GAACCGGCGC TCGCGATGTG GATCGTGTCA AGATTCCGCT GGACGCCCCG
CATCGTTCTG TCGATGTTGG TGCGACTGAT TTTACAGTCG AGTTCTGGAT GCGAGCGCTG
CCGGGCGCCA ATGCCAGCGG TCCCTGCGTT GCCGGACAGG ACACGTGGAT CAACGGCAAC
ATCATGTTCG ACCGTGATAT CTTCGGCACA GGCGATTACG GAGACTATGG CATTTCACTC
TATGGCGGGC GCATCGCGTT CGGCGTTGCG ACATCGACTG CGAGCCGGAC AATCTGTGGT
GCGACGAATG TCGCCGATGG GCAATGGCAT CATATTGCCG TAACACGGCG CTCCAGCGAT
GGACAGTTGA GCATTTTTGT GGATGGAGTA CTTGATGGAC AGGCGCCCGG TCCGGTTGGC
AATATCAGTT ACCGGGATGG ACGCGCCGCG CAATGGCCCA ATGAGCCGTA CCTGGTCATC
GGCGCCGAAA AACATGACTA CGACCGCGTC GCCTATCCAT CATTCAATGG CTGGATCGAC
GAGGTGCGCA TTTCAAATAC GCTGCGATAT ACTGTATCAT TTGCCCGTCC AACGCAACCG
TTTACACCCG ATGCGTTGAC GGCTGCACTC TACCATTTCG ATGAAGGCGC CGGATTGATT
GTCAATGATA CATCCGGCGC GGTTGATGGA CCTGCCAACG GAACTCTCTT TGTTGATCCA
CTGTCAGGCG GTCCTGCATG GTCTGCAGAA ACGCCACCCT GGACAGGAAA TCCACCAACC
AGCACGCCCG CACCGACGGC GACCGATACG CCCGTGCCGA CGGCGACGAA CACGCCCGTG
CCGCCGACGG CGACGAATAC GCCTGTGCCG CCGATGGCGA CGAATACGCC TGTGCCGCCG
ATGGCGACGA ACACGCCCGT GCCGCCAACG GCGACGAACA CGCCCGTGCC GCCGACGGCG
ACCGATACGC CTGTGCCGCC GACGGCGACA AATACGCCCG TGCCGCCAAC AGTGACGAAT
ACGCCCACTG CCACGCCGGT TCCTCCCACG GCAACGCATA CACCGCTGCC AACGGCGACG
AACACGCCCG TGCCGCCGAC GGCGACCAAT ACGCCGCTGC CGACGGCGAC GAACACGCCC
GTGCCGCCGA CGGCGACGAA TACGCCTGTG CCGCCGACGG CGACCAATAC GCCGCTGCCG
ACGGCGACGA ACACGCCGCT GCTGCCGTCA AACAATGCGC TGCGCTTCGA CGGGGCGAAT
GATGAGGTGC GCGGCGGACT GCTCGCCGGA CTGGGCGGCG TGCAGACCAT CGAACTCTGG
GTGCGTCCGG CGACCGGCGG GCAGGATAGC GTCATCATCG CCCATGGCGA CGACGATTCT
GGTTGGGCGC TGGAACTGAA CGGCGGTCGC GCCACCTGGT GGGTCGCTTC GACCGCCGGC
TGGCGCGCGG CGCAACACCC GACGGCGCTG CTCGCCAATA CCTGGTACCA CATCGCCGTG
ACCTACGACG GCGCAACCGC GCGGGTGTTC GTTAATGGTT CATCCGGGTC GGCGGTGACC
ATCGGCGCGA TCACGCAGGG ACCGTTCCTG CGTATCGGTG GGCTGGCAGG GTACGGCTTC
TTCAACGGCG ACATCGATGA CGTGCGCATC TCGAATGTCG TGCGCTACAC CAGCACGTTT
ACACCGCCTT CAACAGCACA TCCGGCGGAT GCGAACACGC GCGCGCTTTA CCGGCTTGAT
GAAGGGAGCG GGCAAACCAC AGCAGATGCT TCGGGGAACG GGTATCACCT GACCCTCGGC
ACAACGGTGA ATGCTGACAG CGCCGATCCG ACGTGGGTGG CGTCAACCGC ACCGATTGCG
CCGCCGCCGA CGGCGACCAA TACGCCACTG CCGACGGCGA CCAATACGCC CGTGCCGCCG
ACGGCGACCA ACACGCCTGT GCCGCCGACG GCGACCAATA CGTCGCTGCC GACGGCGACC
AATACGCCCG TGCCGCCGAC GGCGACCAAC ACGCCTGTGC CGCCGACGGC GACCAATACG
CCCGTGCTGC CGACGGCGAC CAGTACGCCA CTGCCGACGG CGACCAACAC GCCTGTGCCG
CCGACGCCCA CGCTAACACC CACAGGCGAA GGTCCGGCAG AAAATCTGTT GCGGAACGGC
GGATTCGAAC TCGACGCCAA CGGCGATACA CGCCCCGATA ACTGGACGTC GAATACGCGC
GTGACCAGAA GCGCAGCAGT GGTGCGTAGC GGCAGTTATG CCATGCGGCA CTACGCGACT
GATAACGCAA ACTATACTAT TTCGCAGACC GTTGCGGGCG TAACTGCGGG GACGAACTAC
GTCCTTGTCG GTTACGTCAC CATTCCCCCC ACCAGTGACA CGTTCACGTT CAATGTGCGT
GTGCGCTGGC TGCGGTCCGA CAGTACGATC ATTCGCACCG ATACAGCGCG CAGTTTCACT
GGCAGCATCA GTGGATGGGA GATGGCACGC GGCGTCTATA CTGCGCCAGC TGGCGCCGTG
CAGGCGCGGG TCGAAATGAA TGTGTCCAGC CTGAATGCAA CGGTGTACGC TGACGATTTT
GCCTTCGGTC CTGTGTCGCC AACTGCAACG CCAACGAGCA CGTCCGTTCC GCCAACGGCG
ACGAACACTC CCGTGCCGCC AACGGCGACA CAGACCCCGG CATCACCGAC TGCAACCTCC
ACACCGTCGC CCACTGCCAG CGCCAACGGT GCACTGGCGT TTGACGGCGT CGATGACGAA
GCGGGCAACA CGGCATTTTC GATGAGCGGC GGATTTACGG TCGAAGCCTG GGTGCGTCCT
TCGAGCGGCA ATCAGAACTC GATTGTGATC GTCACCGGTG ATGGGTCACG CGGCTGGTCA
CTGGAACTCA ACGATGGTCG TGCAACACTG TGGGTTGCAA ACAACGCAGG CGCCTGGTCG
TTTGTTCGCA ATGATAGTGT CGTGTTGCAA GCCAATCAGT GGTACCACAT TGCAGCAACG
TATGAAAACG GCAATGCGCG CGTGTTTGTC AACGGCGTTG CGGGAACATC AGGGACGGTT
GGCGCAGTGA GTCACATGCC GGTCCTGCGC CTGGGCGGAT TGACTAGCTA CGGCTTCTTT
GCCGGTCAGA TCGACGACGT CCGCATCTCG CGGATCGTGC GGTACACGGG CAGCTTCACA
CCACCATCGA TGCCGCTGCC AGCGGATGCC AATACTATCG CGCTCTACCT CTTCGATGAA
GGGAGCGGGC AGACGGCGTA TGATGCGTCG GGGAACGGGT ATCATCTGAC GCTTGGACGA
AGCGCTGGCG TGGACAGCGC CGATCCGCAG CGCGTTGTAT CGAGTGCGCC GGGCAGATAA
 
Protein sequence
MRFVRLPVAS VIVLVVAIAS LIGVTLLLPG NVAAQQGYSL RFYGTGARDV DRVKIPLDAP 
HRSVDVGATD FTVEFWMRAL PGANASGPCV AGQDTWINGN IMFDRDIFGT GDYGDYGISL
YGGRIAFGVA TSTASRTICG ATNVADGQWH HIAVTRRSSD GQLSIFVDGV LDGQAPGPVG
NISYRDGRAA QWPNEPYLVI GAEKHDYDRV AYPSFNGWID EVRISNTLRY TVSFARPTQP
FTPDALTAAL YHFDEGAGLI VNDTSGAVDG PANGTLFVDP LSGGPAWSAE TPPWTGNPPT
STPAPTATDT PVPTATNTPV PPTATNTPVP PMATNTPVPP MATNTPVPPT ATNTPVPPTA
TDTPVPPTAT NTPVPPTVTN TPTATPVPPT ATHTPLPTAT NTPVPPTATN TPLPTATNTP
VPPTATNTPV PPTATNTPLP TATNTPLLPS NNALRFDGAN DEVRGGLLAG LGGVQTIELW
VRPATGGQDS VIIAHGDDDS GWALELNGGR ATWWVASTAG WRAAQHPTAL LANTWYHIAV
TYDGATARVF VNGSSGSAVT IGAITQGPFL RIGGLAGYGF FNGDIDDVRI SNVVRYTSTF
TPPSTAHPAD ANTRALYRLD EGSGQTTADA SGNGYHLTLG TTVNADSADP TWVASTAPIA
PPPTATNTPL PTATNTPVPP TATNTPVPPT ATNTSLPTAT NTPVPPTATN TPVPPTATNT
PVLPTATSTP LPTATNTPVP PTPTLTPTGE GPAENLLRNG GFELDANGDT RPDNWTSNTR
VTRSAAVVRS GSYAMRHYAT DNANYTISQT VAGVTAGTNY VLVGYVTIPP TSDTFTFNVR
VRWLRSDSTI IRTDTARSFT GSISGWEMAR GVYTAPAGAV QARVEMNVSS LNATVYADDF
AFGPVSPTAT PTSTSVPPTA TNTPVPPTAT QTPASPTATS TPSPTASANG ALAFDGVDDE
AGNTAFSMSG GFTVEAWVRP SSGNQNSIVI VTGDGSRGWS LELNDGRATL WVANNAGAWS
FVRNDSVVLQ ANQWYHIAAT YENGNARVFV NGVAGTSGTV GAVSHMPVLR LGGLTSYGFF
AGQIDDVRIS RIVRYTGSFT PPSMPLPADA NTIALYLFDE GSGQTAYDAS GNGYHLTLGR
SAGVDSADPQ RVVSSAPGR