Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1300 |
Symbol | |
ID | 5538772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1673372 |
End bp | 1679380 |
Gene Length | 6009 bp |
Protein Length | 2002 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640893438 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001431415 |
Protein GI | 156741286 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.64211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.760951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCGC TCTATCGCCG GTTTGTATTG CTGATGGTAT GCGTGACGAT CCTTGCCGCG TGCGGTAGTC CGCGCCCGCA GCCAACACCG ACCCCCGTGA CCGCTCAACC AACCCCGGTT GCGCCACGAC GCGATATTCC ACAACCGCCA ACCCAGGCGG CGCCGACCCT GGTGGCCCGT TCACCTGAAC CGGGGCAGGC GCTCGATCCC GGCGCACCGG TTGAATTGGT CTTCGACCGC CCCATGGACC GCGCATCGGT GGCGGCAGCC CTGACCGTCG CCGGAGTAAC CGGCGTAGTG GAATGGCCCA ATGCGCGCAC CGTTCGTTTC GTGCCGGGCG CACCGCTCAA ACGCGCTTCC ACATATGAAG TCATGCTGCG CGAAACGGCG AAAAGCGCCG ATGGCATACC CCTGGCGGCG CCGGTGCGCT TCCGCTTCGC AACCGCAGGC TTCCTCGAAG TCGGGCAGGT GATCCCTGCC GACGGCGCCG CCGACGTGCA GCCCAATGCG ACTATCACCG TCTTTTTCAA TCGTCCTGTC GTTCCACTGA CGGCAATCGA GATGCAGACG AACCTGCCGC AGCCGCTGAC GTTCGACCCG CCAATCGCAG GGCGCGGAGA GTGGCTGAAT ACGGCGATCT ACACCTTCAT CCCTTCAGCG CCGCTCGCCA GCGGCGCCAC GTACACCGGA CGCATCGCCG CCGGGCTGAC CGATGTAACC GGGAATCCGC TCCCGTCGGA GTATACCTGG CGCTTTACCG TCGCTCGTCC GCAGGTGGTG ACGATCAATC CTTTCGATGG CGCGACCCTC GTGCCGCTGC AACCATCCAT CACGCTGCGC TTCAATGTGC CGGTCGATCC CGCCTCTGCG CGCGCAGCGT TTCGTCTGCG CGGTTCTGAT GGCGCCGACA TCCCCGGCGA TCTCCAGGTC ACCGACGAGA CGCTGGTCTT CACACCGGCG CAACGGCTGG AATTTGACAC GCGCTACACG GTTGAGGCGG CTGCCAGTCT GACCGGTATT TCCGGCGGGC TTGGCATGGC GAATGATTTC AGCGCAACGT TCCAGACGGT TCCTCGCTTG CGCATTCTGG AAACCGATCC GCGTGATGGG GAGACGAATG CACGCCGCGG CGGGTTGACG ATCCGCTTCA ACGCCCCCGT CGATCCGGCG ACCGTATTGC CGAATGTCGC CATCACGCCG CAACCAACGG AGGTGTACAC CTATTTCTAC GACACGACGT TCAATCTCAG TTTCGATACA CGCCCTTCGA CGGAATATTC GGTTGCGATC GGACCGGACA TCGCCGATCC CTACGGGAAT CGCACCGGTC AGTCGCTAGC GGTGCGCTTC CGCACAACGC CGCTTGAACC GCAGGTCTAT CCATTGACCC CTGGTTTCAT CACGACATTC GACGCCAACC GCGCGCCGCG CATTGCGTTA ATGGCAACCA ATGTCAATAA TGCATCACTG GCGCTCTATC GTCTGCCCGT CGAGGCGTTG CTGCGCCGTG AGATTCTCGG ACCTGACGGC GTCTCCCCAC CGTCTGGCGC GACGCTCGTG CGCCGCTGGC AGGCGCAGTT CAGTGTGCCG CGCGATGAAC CAACACCGGT GCGCATTGAT CTGGTCGATG GGGGGGGGCG CCTCGATCCT GGCTTGTATC TGCTGTTGCT CGATCATCCT TCCGGCTATC CCGAAACGCG GGTGCTGGCG GTATCACCAT TGCATCTGAC ACTGAAGGCG GCGGAGCGCA CTGCGTTGGT GTGGGCGAAT GATCTGACGA CCGGTGCGCC GGTGTCTGGT CTGGCGCTGG AATTGTTCGA CGATCAGGGC GCATCACTGG GCACAGCGAC TACTGACGCA AATGGGGTGG CGACGACGAC TCTCAACCGC ACCGAGTACC GTGGGATGGT CGCAGTCGCG CGACAGCCGT TCGCAATCTT CGGCGCAGAT TGGGGAACCG GCGTCACTCC GTGGGATTTC AGCCTCCCGG CGTCGTTCGA CCTGCCAGAA GTGACTGCCT ATGTCTACAC CGACCGACCA ATCTACCGTC CCGGTCAGCG GGTGTGGTTC AAGGGGGTCG TGCGCGCCGA GGACGATGTG CGCTATACCC TCATGCCGGG GTTGAACACG GCACAGGTTG CCGTTTACGA TGCAGCCGGT GAATCGATCA TTCAGCAACC GGTGAGTCTG AATCCGAACG GCGCCTTCGA CGGCGGGTTC ATGCTGGCAG CAGGCGCGCC GACCGGTCAG TACGCCCTCA GTCTGAATGT TGGCGGTCGC GAGTTCCGTT TCCCCTTCCA GGTCGCTGCC TATCGTCCCC CAGAGATCGA GGTGACGGTG ACGCCGCGCG CTGCCGGGAT TATGCGCGGT GCACCGACAG AAGCAACAGT GCGCGCCGCT TATTTTTTTG GCGCGCCGGC CGCAAACCTG CCGGTGCAGT GGAATGTGCT GGCGGAACCA TTCGCTCCCG CGCCTGATTG GGCGGGCAGG TACACCTTCG ACGAGTCTGG CGATGTGTGG GTCTGTCGCT TCTGCTGGTG GATTCCCGCT CCGCCGCCGC AACCGATCCT CTCCGGCAGC GCCACAACCG ATGCGCAGGG ACAGGCGATC ATCAGCCTGC CGGGTGAATT GCGCGACCCG GAAGGCAACG TTATCACCCG CAGCGCGCGC CTGACGGTCG AGGCGACGGT CACCGGGCGC GATAATCAGG CAATCAGCGG GCGCACTGCT ATCGTCGTGC ATGCCAGCGA CCTGTACGTC GGGCTGGCGC CGCGCGCGTA TGTCGGCAGG GCAGGCGCGG CGCAGCAGAT CGACCTGGTG ACCATCGACA CGCGCGGCAA TCGCCTGGCA AGCCGCGCGG TCGAAATCGA ACTGGTGCGC ACCACCTGGG AAAATCGCTT CGTGCAGGAC GACGCTGGCG GACGCTGGGA GTCGCGTGAG GTGCGCGAAC CTGTCGGCAC GCAGACGGTC ACAACCGACG CGAATGGCGA GGCGGTCGTT TCGTTCACTC CCGACAAAGG TGGCGCCTAC CTGGTGCTGG CGCGCGCACG CGATGCTGGC GGGCGCGAGG CGCGCTCCTC GCTGTATGTC TGGGTCTACG GCGGCGATGC GCTCTGGCTG CGCGAGAATA ATGATCGCAT CAACCTGATA GCCGACAAAA GCGAGTATCG CCCCGGCGAA ACAGCAACGA TCCTCATCCC CTCGCCGTTC ACTGGAACGC ATTGGGCGCT GCTGACCGTC GAGCGCGGCG GCGTCTTGAG CCACGAGGTG CGCCAGGTCA GCGGCGGCAG TCTGGTCTAT CAACTGCCAA TCACGGCTGA TCACGCGCCG AATATCTTTG TTTCGGCAGT GCTGTTTGCG CCGCCGGACG GCAGCGGCGC GCCTGCCGAT TTCAAGGTCG GCGTCCTGCC ACTCACCGTT GTGCCGACTG CGCAGATGCT GCGGGTTGAA GTGACCACTG CGACACCGCA GGCAGCACCC GGCGACGCAG TAACCTTCGA TGTGCGCGTG ACCGACACAA ACGGCGCGCC GGTAGCGGCG GAACTGTCGC TCGACCTGGT CGATAAAGCC GTGCTGTCGC TCCAACCACG CGAGCCAGAT GCCATTGTCC AGGGGTTCTA CGGTCGCCGT CCGCTGGGGG TGTTCACCAG CGCCGGTCTT TCGGTTGCCG CCGAGCGCTT CGAGCGTTTG CTGGACGAAG CGCAGCGCAA TGTGCCGCCG GGCGCAGGCG CGGCTGGACC AGAGACCGCT ATCCCTATGG TCGGGGCAAC GCCAACGGCG CCAGCCGCTA TGCCTGCCGA GGCGATGCCC GCGCGCACCG GCGATGCCGC ACTTCAGCAG GGATTGACTA TTCGCCAGGA GTTTGCCGAC ACCGCGTTCT GGCAGGCGGT TGTGACCACC GACGCTGGCG GGCGCGCGAC GGTGCAGGTT TCGCTGCCTG ATAATCTGAC GACCTGGGTG ATGCGCGGCG TGGCGCTCAC AATGGATACG CGCGTCGGTG AAGGAACTGG TGAACTGGTC AGCACGAAGC CATTGCTTGT GCGCCCGGTC ACGCCACGCT TCTTTGTCGT TGGGGATGTG GTAGAACTGG CAGTAAACGT CAGCAATCTT ACAAATACGC CCATGATGAC AACGGTGACC CTGAGCGCCG ATGGGGTTAC GGTCACCTCG CCGATCACGC AGACGATCCA GGTTCCGGCG AATGGCGAAG CCTCGGCAGC CTGGCAGGTA ACGGTTCTTG ATGGAGAGTC AGTCGATCTG GTATTCAGCG CCGTCTCCGG GCAACTGAGC GACGCGGCGC GACCCCGAAT CGCAACCGCG CCAGACGGGC GCATTCCGGT CTACCGCTAC AGCGCGCCGG AGACCGTGGC GACCGGCGGA CAGATCGACC GGGCAGATGC GCGCGTCGAG GCGGTAGCGC TGCCGCCGAA TGTCGATGCG CGCCTGGGAG AATTGCGCAT CCGGCTCGAT CCTTCGCTGG CGGCCAGCGT ACTCGACGGT CTGACGGCTT TGGAAGAATA TCCGTATGTC ACCGTCGAGT CGACCGTCTC GCGCTTTGTG CCGAATGTCG TGGCGCTGCG GATGCTCCGC CAGCTCGGAG TGACAAACAC CGAACTGGAG GCGCGCCTGC CGACCCTGGT CGCTGATGCG CTCGACCGGC TCTCCCTGTG GCAGAATGCC GACGGCGGAT GGGGCTGGTG GGCAGACGAT GAGAGCAATC CATACATCAG CGCGTATGCG GTATTCGGCA TGCTGCGGGC GCGCGAAGCG GGCTTCACCG TGCGCGACGA CACGCTGGCG CGCGGAACGG AGTATCTCGC TGCGCAACTC GCCGCCGACG CCGATGTGCG CACAGCACAG CAAGCGAACC GGCAGGCATG GCTGCTCTAC GTGCTTGCCG ACGGCGGCAG ACCGGATCGT GGGCGGATGG ACGCGCTCTA CGGCAACCGT GAGCGTCTTG GCGTGTACGG CAAGGCGTTG CTGGCGCTGG CGCTCCACCG CGTCGATGCT GGCGATGCGC GGCTGAAGAC ACTCCTTTCG GACCTGAACA ATGCCGCAAT TGTCAGCGCC ACCGGCGTCC ATTGGGAGGA AGCCGCGCGA GACTCCTGGG CATTCAGCAG TGACACGTGC AGTAGCGCGA TTGCGCTCCA GGCGCTTGTG CGACTCGACC CGCAGAACCA GATTATTCCC AACGTCGTGC GCTGGCTTAT GGTTGCTCGC CGTGGCGACA TCTGGCTCAC GACCCAGGAA TCGGTGTGGG GACTGCTGGC GCTGACCGAC TGGATGGCAA CGACCGGCGA ACTCAACGGC GCCTATGACT ATGCTGTCTG GCTGAATGGT AATGAGCGCA TCGCCGGGCG CATCGACGCC ACCAACGTCA TGTCGGCGAC GGTGGTGCGT GTGCCGACGA CTGAACTGCT GATCGGCGAC CCATTGCTGG TGGCGGTGGG GCGCAGCGAA GGAGCAGGTC GGCTGTACTA CACAGCGCAT CTGAACCTGG CGCTGCCAGC CGATCAGGTG AAGGCGCTCG ACCGGGGCAT CGCCGTGACG CGACGGTACG TGGCGGCGGA TTGCACGGAC GGACCACGTT GCCCGACGTT GACCAGCGTC AAAGCGGGCG ACATGGTGCG AGTCGAACTC TCGATTGTAG CCGAGCGCGA TCTCTACTAC TTCCAGATCG AAGATCCGCT CCCCGCTGGC GGCGAAGCCA TCGACCCGAA CCTGGCGACG ACTGTGATCG CTTCAGATAG CGGACCAACG CTGCGACCGG CGCCGGATGC CGCGACGCCG TACTGGTGGT GGTGGCGCTG GTACGACCGG GTCGAGCTGC GCGACGAAAA GGTTGCGCTG TTCGCCGATT ACCTGCCGCG CGGCGCGTAT CTCTTCAGTT ACACCTTCCG CGCCGTGCAA CCGGGCGAAT ACCGCGTCAT TCCGACACTT GCGCAGGAGA GTTTCTTCCC CGAAGTCTTC GGACGAGCGG ATGGGCAACT GTTCGTCATC ACGCGGTAA
|
Protein sequence | MPPLYRRFVL LMVCVTILAA CGSPRPQPTP TPVTAQPTPV APRRDIPQPP TQAAPTLVAR SPEPGQALDP GAPVELVFDR PMDRASVAAA LTVAGVTGVV EWPNARTVRF VPGAPLKRAS TYEVMLRETA KSADGIPLAA PVRFRFATAG FLEVGQVIPA DGAADVQPNA TITVFFNRPV VPLTAIEMQT NLPQPLTFDP PIAGRGEWLN TAIYTFIPSA PLASGATYTG RIAAGLTDVT GNPLPSEYTW RFTVARPQVV TINPFDGATL VPLQPSITLR FNVPVDPASA RAAFRLRGSD GADIPGDLQV TDETLVFTPA QRLEFDTRYT VEAAASLTGI SGGLGMANDF SATFQTVPRL RILETDPRDG ETNARRGGLT IRFNAPVDPA TVLPNVAITP QPTEVYTYFY DTTFNLSFDT RPSTEYSVAI GPDIADPYGN RTGQSLAVRF RTTPLEPQVY PLTPGFITTF DANRAPRIAL MATNVNNASL ALYRLPVEAL LRREILGPDG VSPPSGATLV RRWQAQFSVP RDEPTPVRID LVDGGGRLDP GLYLLLLDHP SGYPETRVLA VSPLHLTLKA AERTALVWAN DLTTGAPVSG LALELFDDQG ASLGTATTDA NGVATTTLNR TEYRGMVAVA RQPFAIFGAD WGTGVTPWDF SLPASFDLPE VTAYVYTDRP IYRPGQRVWF KGVVRAEDDV RYTLMPGLNT AQVAVYDAAG ESIIQQPVSL NPNGAFDGGF MLAAGAPTGQ YALSLNVGGR EFRFPFQVAA YRPPEIEVTV TPRAAGIMRG APTEATVRAA YFFGAPAANL PVQWNVLAEP FAPAPDWAGR YTFDESGDVW VCRFCWWIPA PPPQPILSGS ATTDAQGQAI ISLPGELRDP EGNVITRSAR LTVEATVTGR DNQAISGRTA IVVHASDLYV GLAPRAYVGR AGAAQQIDLV TIDTRGNRLA SRAVEIELVR TTWENRFVQD DAGGRWESRE VREPVGTQTV TTDANGEAVV SFTPDKGGAY LVLARARDAG GREARSSLYV WVYGGDALWL RENNDRINLI ADKSEYRPGE TATILIPSPF TGTHWALLTV ERGGVLSHEV RQVSGGSLVY QLPITADHAP NIFVSAVLFA PPDGSGAPAD FKVGVLPLTV VPTAQMLRVE VTTATPQAAP GDAVTFDVRV TDTNGAPVAA ELSLDLVDKA VLSLQPREPD AIVQGFYGRR PLGVFTSAGL SVAAERFERL LDEAQRNVPP GAGAAGPETA IPMVGATPTA PAAMPAEAMP ARTGDAALQQ GLTIRQEFAD TAFWQAVVTT DAGGRATVQV SLPDNLTTWV MRGVALTMDT RVGEGTGELV STKPLLVRPV TPRFFVVGDV VELAVNVSNL TNTPMMTTVT LSADGVTVTS PITQTIQVPA NGEASAAWQV TVLDGESVDL VFSAVSGQLS DAARPRIATA PDGRIPVYRY SAPETVATGG QIDRADARVE AVALPPNVDA RLGELRIRLD PSLAASVLDG LTALEEYPYV TVESTVSRFV PNVVALRMLR QLGVTNTELE ARLPTLVADA LDRLSLWQNA DGGWGWWADD ESNPYISAYA VFGMLRAREA GFTVRDDTLA RGTEYLAAQL AADADVRTAQ QANRQAWLLY VLADGGRPDR GRMDALYGNR ERLGVYGKAL LALALHRVDA GDARLKTLLS DLNNAAIVSA TGVHWEEAAR DSWAFSSDTC SSAIALQALV RLDPQNQIIP NVVRWLMVAR RGDIWLTTQE SVWGLLALTD WMATTGELNG AYDYAVWLNG NERIAGRIDA TNVMSATVVR VPTTELLIGD PLLVAVGRSE GAGRLYYTAH LNLALPADQV KALDRGIAVT RRYVAADCTD GPRCPTLTSV KAGDMVRVEL SIVAERDLYY FQIEDPLPAG GEAIDPNLAT TVIASDSGPT LRPAPDAATP YWWWWRWYDR VELRDEKVAL FADYLPRGAY LFSYTFRAVQ PGEYRVIPTL AQESFFPEVF GRADGQLFVI TR
|
| |