Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2190 |
Symbol | |
ID | 5209153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 2694483 |
End bp | 2700491 |
Gene Length | 6009 bp |
Protein Length | 2002 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640595792 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001276520 |
Protein GI | 148656315 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.753636 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCGC TCTACCGCCG GTTTGCGATC CTGTTGATAC TTGCGGTGAC TCTCGCCGCA TGCGGCGGAC CGCGCCCGCA ACCAACCCCA ACCCCGGTGA CTGCGCAACC CACTCCGGTT GCCCCGCAAC GGGATATTCC CCCACCGCCG GATCGCGCGG CGCCGATCCT GGTCGCCCGC TCGCCCGAAC CGGGACAGGC GCTCGATCCG GGCGCGCCGA TTGAACTGGT CTTCGACCGT CCGATGGATC GCGCGTCGGT TGCGGCGGCG CTGAATATCG CCGGGGTGAC CGGCGCGATT GAATGGCGCG ATGCGCGCAC GGTGCGCTTC GTTCCGTCCG CGCCACTGCA ACGCGCTTCC ACGTATGAGG TGTTCCTGCG CGAAACGGCG AAAGGCGCGG ACGGCTTGCC ACTCGCGGCG CCGGTGCGCT TCCGCTTCGC CACTGCTGGA TTCCTCGAGG TCGGGCAGGT GATCCCTGCT GACGGCGCGG CTGATGTGCA GCCCAATTCG ACCATCACGG TCTTCTTCAA CCGTCCGGTT GTGCCGCTCA CGGCGATCGA GTCGCAGGCG AACCTGCCGC AACCGGTGAC ATTCGACCCG CCGATCGCGG GGCGCGGGGA ATGGCTGAAT ACCGCCATTT ATACGTTCGT CCCGGCAGCG CCGCTTGCCG GCGGCGCGAC CTACACCGGA CGCATCGCCG CAGGTCTGAC CGATGTGACC GGCAATCCGC TCCAGTCGGA GTACACCTGG CGTTTCACCG TCGCCCGTCC GCAGGTGGTG ATGATTGACC CATCCGATGG CGCAACCCTT GTGCCACTGC AACCACGCAT CACGTTGCGC TTCAATGTGC CGGTCGATCC TGCATCGGCG CGCGCCGCAT TTCGTCTGAG TCAGCCGGAT GGCGCCCCTA TTCCTGGCGA TCTCCAGATT GACGGTGAGA CGCTGGTGTT CACGCCGTCG CAGCGGCTGG ACTTCGAGAC GCGCTACACG GTCGAAGTGG CTGCCGGTCT GACGGGTGTT TCCGGCGGTC TCGGCATGGC AAACGATTTT CGCGCAACCC TGCAAACGGT TCCACGATTG CGCATTCTAG AGACCGATCC GCGCGATGGC GAGACGGATG CGCGGCGCGG CGGTTTGACG ATCCGGTTCA ATGCGCCGGT CGATCCGGCG ACGGTGCTGC CGAATGTTGC GATCACGCCA CAGCCGACGG AGGTGTACAC CTATTTCAAT GAGTATGACA ATACGTTCTT CATCAGTTTC GACACGCGCC CTTCGACGGC GTATACGATT GCGATCGGAC CGGATATCGC CGATCCTTAC GGGAGTCGCA CCGGTCAGTC CCTGACGGTG CGCTTCCGCA CTGCGCCGCT GGATCCATGG GTCTATCCGC TGACACCCGG TTTCATCACG ACGTTCGACG CTAACCGCGC GCCCCGGATT GCGCTGATGG CAACCAATGT CACCAGTGCG TCGCTGAGGC TCTACCGTCT GCCGATCGAG GCGCTGCTGC GCCGCGACAT CCTCGGACCC GATGGTGCAT CCCCTGCTTC CGGCGCGACG CTTGTGCGCA GCTGGCGCGA GCAGTTCAGC GTGCCGCGTG ATGAACCGAC GCCGGTGCGC GTCGATCTGG TCGAGGGGGG TGGTCGCCTC GATCCTGGAC TGTACCTGCT GCTGCTGGAT CTTCCCTCCG GCTATCCTGA GGCGCGAGTG CTGGCAGTGT CGCCGCTGCA CCTGACACTG AAGGCGGCGG AACGCAATGC GCTGGTGTGG GCGAATGACC TGACGACCGG CGCGCCGGTT TCCGGTCTGA CGCTGGAGTT GTTCGATGAG CAGGGCGGTT CGCTTGGAAC GGCGACCACC GACGCGAATG GGGTGGCGAC GGCGACGCTC AACCGCACCC AATACCGTGG AATGATCGCA GTGGCGCGGC AACCGTTCGC CATTTTCGGC GCAGACTGGA GCGCCGGCGT CAGCCCATGG GATTTCAGCC TCCCGGCATC ATTCGATCTG CCAGAAGTCA TCGCCTATGT CTACACCGAC CGCCCGATCT ATCGTCCCGG TCAGCGGGTG TGGTTCAAGG GCGCAGTGCG CGCCGATAAC GATGTGCGCT ACACCCTCAT GCAGGGGTTG AACGCGGCGC AGGTCACGGT CTACGATGCC GCTGGCGAAG CGGTCTTCCA GCAGGCGGTG AATCTGAACC AGAACGGCGC CTTCGACGGC GGGTTCACGC TGGCGAATGG CGCGCCGACT GGTGAGTACG CTATCAGTCT GAATATCGGC GGTCAGGTGT TCCGTTTCCC CTTCCAGGTC GCCGCTTACC GTCCGCCGGA GATCGAAGTC ACGGTGACGC CGCGCGCTGC TGAGGTTGTG CGCGGCACGC CAACCGATGC GACGGTGCGC GCCGGGTATT TCTTCGGCGC GCCAGCCGCG AACCTGCCGG TGCAGTGGAA TGCGCTGGCG GAACCGTTCG CGCCTGCGCC CGATTGGGCA GGCAGGTACA CCTTCGGCGA ATATGGCGAT CTGTGGATTT GCCGTTTCTG CTGGTGGGTT CCATCGCCTC CGCCGCAACC GATCCTCTCC GGCAGCGCGA CGACCGATGC GCAGGGGGAG GCGATCATCA GTCTGCCGGG TGAACTGCGC GACTCGGAAG GGAATGTCAT CACGCGCAGC GTGCGCCTCA CCGTCGAGGC GACCGTCACC GGGCGCGACA ATCAGGCGAT CAGCGGGCGC AGCACCGTCA TCGTGCATGC CAGCGACCTG TACGTCGGTC TGGCGCCGCG CGCGTATGTC GGCAGGGCAG GCGTTGCGCA GCAGATCGAT CTGGTGACGA TCGACACACG CGGCAATCGT CTGGCAAACC GCACGGTCGA AATCGAACTG GTGCGCACCA CGTGGGAGAA CCGCTTCGTG CAGGATGACG CTGGCGGACG GTGGGAATCG CGCGAGGTGC GCGAGCCTGC CAGCACGCAG ACGGTCACGA CCGACGGGAA TGGCGATGCA GTCATTTCGT TTACCCCCGA CAAAGGCGGC GCTTATCTGG TGCTGGCGCG CACACGCGAC GCTGGCGGGC GCGAGGCGCG TTCATCGCTC TACGTCTGGG TGTACGGCGG CGATGCGCTC TGGCTGCGCG AGAATAACGA CCGCATCAAC CTGATCGCCG ACAAAGGCGA GTATCGCCCC GGCGAGACGG CGACCATCCT CATCCCCTCG CCGTTCACCG GACCGCACTG GGCGCTGCTG ACCGTCGAGC GCGGCGGTGT GCTGAGCCAC GAGGTGCGCC GGGTCAGCGG CGGCAGCCTG GTCTATCAAC TGCCGATCAC CCCAGAGCAC GCGCCGAACA TTTTCGTCTC GGCGGTGCTG TTCGCGCCGC CCGACAGCAA TGGCGCACCG GCGGATTACA AGGTCGGCAT CCTGCCGCTG AGCGTGACAC CCGTCCTGCA AACCTTGCAG GTCGCCGTGA CAACCACAAC GCCACAGGCA GCGCCTGGCG ACACGGTGCA GTTTGAGGTG CGCGTCACTG ATGTGACAGG CGCGCCGGTG GCGGCGGAAC TGTCGCTCGA CCTGGTCGAC AAAGCGGTGC TGTCGCTGCA ACCGCGCGAG CCAGACGCGA TCGTGCAGGC ATTCTACGGT CGTCGTCCGC TGGGGGTGTT TACCGGCGCC GGGCTGTCGG TCGCCGCCGA GCGATTCGAG CGGTTGCTGG ACGAGGCGCA ACGCAATGCG CCTCCGGGCG CGGGCGCGGC TGGACCGGAG GCTGCCGTAC CGATGGTCGG GGCAGCGCCG ACAGAAGCGC CGGCAGCGGC AATGCCTGCG CGCGCAGGAG ACGCCGCACT TCAACAGGGA TTGACCATCC GCCAGGAGTT TGCCGATACC GCATTCTGGC AGGCGATTGT GGCGACCGAT GCCAGCGGGC GCGCCAGCGT GCAGGTGACG CTGCCCGACA ACCTGACGAC GTGGGTGATG CGCGGCGTGG CGCTCACCGC CGATACGCGC GTCGGCGAAG GGACCGGCGA ACTGATTAGC ACCAAACCGC TGCTGATCCG TCCGGTCACG CCGCGTTTCT TCGTCGTCGG TGATGTGGTG GAGCTGACGG CGAATGTCAG TAATCGCACG AATGCGCCAC TGGCGACAGA GGTGACACTG GGCGCCGATG GGGTGACGGT CAACGCGCCA ATCACGCAGA CGATCCAGGT TCCGGCGAAC GGCGAGGCGT CGGCGACCTG GCAGGTGACC GTCCTCGATG TCGAGGCGGT CGATCTGGTG TTCAGCGTAG TATCCGGTCA GCTGAGCGAT GCGGCGCGAC CGCGGATCGC CAGCGCGCCA GGTGGACGCA TTCCGGTCTA TCGCTACAGC GCACCGCAGA CGGTTGCGAC CGGTGGGCAG ATCGACACGG CAGGCGCGCG GGTCGAGGCG GTGGCATTGC CGCCGAATGT CGATGCGCGC CTGGGAGAAC TGCGCGTTCG GATCGATCCC TCGCTGGCAG CCGGCGTGCT CGACGGCTTG CGGGCGCTGG AGGAATACCC GTATGAGTCG GTCGATGCAA CCGTGTCGCG CTTCGTGCCG AGTGTGGCTG CGCTGCGGGC GCTGCGTCAG TTGGGCGTGG CGAATGCCGA ACTGGAGGCG CGTTTGCCAG CGCTGGTGAC CGATGCGCTC GACCGGCTCT CCCTGTGGCA GAACGCAGAC GGCGGATGGG GCTGGTGGGC GGAGGATGAG AGCAATCCGT ATATGAGCGC GTATGTAGTG TTCGGCATGC TGCGCGCGCG CGAAGCAGGT TTCACCGTGC GTGATGATAC ACTGGCGCGC GGGATGGAGT ATCTCGTCGC GCAACTCGCC GACGACGCCG ATGTGCGCAC TGTGCAGCAG GCGAACCGGC AGGCATGGCT GCTCTACGTG CTTGCCGATG GCGGCAGACC GGACGGCAGA CGGATGGATG CGCTCTACAA CAACCGCGAG CGCCTGGGCG TGTATGGCAA AGCGCTGCTG GCGCTGGCGA TCCACCGCGT CAATGCCAGC GATGCCCGCC TGAAAACCCT GCTTTCCGAT CTGAACAATT CTGCAATTGT GAGCGCGACC GGGGTCCACT GGGAAGAAGC CGGGCGAGAC TATTGGGCGT TCAGCAGTGA CATGTGCAGT AGCGCGATTG CGCTCCAGGC GCTCGTGCGG CTCGACCCGC AGAACCAGAT CATCCCCAAC GCCGTGCGCT GGCTCATGGT CGCTCGCCGC GGCGACGCCT GGCTCTCAAC CCAGGAGTCG ACGTGGGCGC TGCTGGCGCT GACCGACTGG ATGGCGCTGA CCGGCGAACT CAGCGGCGAC TACGACTATG CCGTCTGGCT GAACGGCAAC GAACGCATCG CCGGTCGCAT CGACGCCACC AACGTCATGT CGCCGACGGT TGTGCGCATC CCGACGACCG ACCTGTTGAC CGGCGATCCA TCGCTGGTGG CAGTGGGGCG CAGCGAAGGA CCGGGGCGGC TGTACTACGC CGCGCACCTG AACCTTGCGT TGCCAGCCGA TCAGGTGCAG GCGCTCGACC GGGGCATTGC CGTGACACGC CGCTATGTGG CGGCAGACTG CACCGACGGA CCGCGTTGCC CCACGCTGAC CAGCGTCAAA GCTGGCGACA CCGTGCGGGT CGAACTCTCC ATCGTTGCAG AGCGCGATCT CTCCTACGTG CAGGTGACAG ACCCACTCCC CGCCGGTGGC GAAGCCATCG ACCCGACCCT GGCGACAACC GCGATCGCCA CACATGTCGG ACCGACACTC CAACCGGCGC CGGACGCTAC GACGCCGTAC TTCTGGTGGT GGTGGCGCTG GTACGACCGG ATTGAACTGC GCGACGAAAA GGTTGCACTG TTCGCCGATT ACCTGCCGCG TGGCGCCTAC CTCTTCAGTT ACACCTTCCG CGCCGTGCAA CCCGGCGAAT ATCGTGTTAT TCCGACTATC GCGCAGGAGA GTTTCTTCCC CGAAGTGTTC GGAAGGTCGG ATGGGCAACT GTTCACCATC ACGCGGTAA
|
Protein sequence | MPPLYRRFAI LLILAVTLAA CGGPRPQPTP TPVTAQPTPV APQRDIPPPP DRAAPILVAR SPEPGQALDP GAPIELVFDR PMDRASVAAA LNIAGVTGAI EWRDARTVRF VPSAPLQRAS TYEVFLRETA KGADGLPLAA PVRFRFATAG FLEVGQVIPA DGAADVQPNS TITVFFNRPV VPLTAIESQA NLPQPVTFDP PIAGRGEWLN TAIYTFVPAA PLAGGATYTG RIAAGLTDVT GNPLQSEYTW RFTVARPQVV MIDPSDGATL VPLQPRITLR FNVPVDPASA RAAFRLSQPD GAPIPGDLQI DGETLVFTPS QRLDFETRYT VEVAAGLTGV SGGLGMANDF RATLQTVPRL RILETDPRDG ETDARRGGLT IRFNAPVDPA TVLPNVAITP QPTEVYTYFN EYDNTFFISF DTRPSTAYTI AIGPDIADPY GSRTGQSLTV RFRTAPLDPW VYPLTPGFIT TFDANRAPRI ALMATNVTSA SLRLYRLPIE ALLRRDILGP DGASPASGAT LVRSWREQFS VPRDEPTPVR VDLVEGGGRL DPGLYLLLLD LPSGYPEARV LAVSPLHLTL KAAERNALVW ANDLTTGAPV SGLTLELFDE QGGSLGTATT DANGVATATL NRTQYRGMIA VARQPFAIFG ADWSAGVSPW DFSLPASFDL PEVIAYVYTD RPIYRPGQRV WFKGAVRADN DVRYTLMQGL NAAQVTVYDA AGEAVFQQAV NLNQNGAFDG GFTLANGAPT GEYAISLNIG GQVFRFPFQV AAYRPPEIEV TVTPRAAEVV RGTPTDATVR AGYFFGAPAA NLPVQWNALA EPFAPAPDWA GRYTFGEYGD LWICRFCWWV PSPPPQPILS GSATTDAQGE AIISLPGELR DSEGNVITRS VRLTVEATVT GRDNQAISGR STVIVHASDL YVGLAPRAYV GRAGVAQQID LVTIDTRGNR LANRTVEIEL VRTTWENRFV QDDAGGRWES REVREPASTQ TVTTDGNGDA VISFTPDKGG AYLVLARTRD AGGREARSSL YVWVYGGDAL WLRENNDRIN LIADKGEYRP GETATILIPS PFTGPHWALL TVERGGVLSH EVRRVSGGSL VYQLPITPEH APNIFVSAVL FAPPDSNGAP ADYKVGILPL SVTPVLQTLQ VAVTTTTPQA APGDTVQFEV RVTDVTGAPV AAELSLDLVD KAVLSLQPRE PDAIVQAFYG RRPLGVFTGA GLSVAAERFE RLLDEAQRNA PPGAGAAGPE AAVPMVGAAP TEAPAAAMPA RAGDAALQQG LTIRQEFADT AFWQAIVATD ASGRASVQVT LPDNLTTWVM RGVALTADTR VGEGTGELIS TKPLLIRPVT PRFFVVGDVV ELTANVSNRT NAPLATEVTL GADGVTVNAP ITQTIQVPAN GEASATWQVT VLDVEAVDLV FSVVSGQLSD AARPRIASAP GGRIPVYRYS APQTVATGGQ IDTAGARVEA VALPPNVDAR LGELRVRIDP SLAAGVLDGL RALEEYPYES VDATVSRFVP SVAALRALRQ LGVANAELEA RLPALVTDAL DRLSLWQNAD GGWGWWAEDE SNPYMSAYVV FGMLRAREAG FTVRDDTLAR GMEYLVAQLA DDADVRTVQQ ANRQAWLLYV LADGGRPDGR RMDALYNNRE RLGVYGKALL ALAIHRVNAS DARLKTLLSD LNNSAIVSAT GVHWEEAGRD YWAFSSDMCS SAIALQALVR LDPQNQIIPN AVRWLMVARR GDAWLSTQES TWALLALTDW MALTGELSGD YDYAVWLNGN ERIAGRIDAT NVMSPTVVRI PTTDLLTGDP SLVAVGRSEG PGRLYYAAHL NLALPADQVQ ALDRGIAVTR RYVAADCTDG PRCPTLTSVK AGDTVRVELS IVAERDLSYV QVTDPLPAGG EAIDPTLATT AIATHVGPTL QPAPDATTPY FWWWWRWYDR IELRDEKVAL FADYLPRGAY LFSYTFRAVQ PGEYRVIPTI AQESFFPEVF GRSDGQLFTI TR
|
| |