Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3939 |
Symbol | |
ID | 5541445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5140937 |
End bp | 5146210 |
Gene Length | 5274 bp |
Protein Length | 1757 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640896047 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001433990 |
Protein GI | 156743861 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTGTGC TGCGACAGAT CGGTGTGATC TGGACCGGCG CTTTGCTGGT CATTGTGGTG GCGTCCCTGG CGCTGCCAGG AGCGCGTTTC CTCATCCTCC CCGCGCTCTC CGCCGTTCCT GCGGTTGTTT CAGTGTCGCC GCCTGATGGC GCGCGCAGTG TGTCGCCACG CGCGGTGCTC ACTATTCAGT TCAGTGCATC GATGAATCCG CCGAGCGTCG AGCATGCGTT GCGTATCGAT CCCGCGACGG ATGTCGTCTA TGGTTGGGAT TTGGATCGCA CAACACTGAC GATCACGCCA ACGACTGCGC TTCAGGCTGG CGTGCGTTAC CACATCACGG TTGCGGAAAC GGCGCTGAGT CGGTATTTTC GCCCGCTGGC GCAACCGTTC ACGTTCGCCT TCGAGACAGC GCCGCCTGCT GCCATCACTG CCCTGTTGCC CCGCGATGGC AGTGTTGACG TCGCACTCGA TACGCTGGTC GGTGTGCGGT TTAGCCGCCC GATTGTTTCG CTCGACGCCC TTGCCCGCTC TGCGACGCTG CCTGCTCTCC GGAGCGACCC GCCGCTGGCG GGAAGCGTCA CCTGGCTCGA CCCGGCGACG TTGCTGTTCC GTCCATCCGA GCCGCTGCGC CCTGGCGTGC GCTATACCTT CTCCCTCGAT CCAGACCTGA CCGATCAGAG TGGTGTTCCG CTTGGACGAG CCTATACCTG GTCGTTTACA ACGCTGGCGC CGATGGTGCG TGAGGTGTCG CCGCCGCCGA ATGCGCGCCT GGTGGGACCG TATGAACCGC TGCGCATGGT GTTTTCGCAA CCGGTCGATC TTCAGGCGCT CGAAGCCGCA CTTTCGATTA CGCCTGCGGC GCCTGGGACG CTTGAAGAAG CGCTTTTGCC CGATGGCACG CAGATTGTGA CATATACGCC CACCATTGCG TGGCAGGCAG GAACCGCTTA CACCGTTGCG CTTCCGGCGG CGCTGGCGGA TGGAACGGCG CTCCTGGCAC AACCGTATCG CTGGAGTTTC GTTACCGCGC CAAAACCGGC ATTGATCGGA CGGTTCCCCG GCGAGGGGCA ACTGCTTCCG CCTGGAAGCA ATGTGCGCCT GATCTTCAGC ACTCCGGTCG ATGCCGAGGC GCTGCGCGCA CAACTGCGTA TCGATCCGCC GGTCGATGCC CTGCGTGTGA CCACCAACGA TGGTGAAGCG CGCATCGATG CATCGCTCCA GGCGGCAACG CTGTATACCA TCACCATCCC GGCATCGCTC ACCGATCGTG CCGGCGTTGC GCTCGAACGT GACTATCAGG TGCGTTTTTT TACGGCGCCT GCTGTACCAT CGTTGACCCT GCCAGAGGCG ACCGGGCGTG TGATCCAGGC GCTTCCCGAT CAGACCATTG GCGTGTTGAC GCGACGGACG AATCTCTCGG AATTGCGGCT GGCGCTCTAT CGGCTCGATG AGGCCACGCT GCTGCGCGCC TTGAGTTTTA GCGACTCGGA ATGGACTGCC TTTGAGCCGT CACGCTACGG ACTGTCACTC TTGCGCTCAT GGTCCGAACC ACTCGCCGAT CCGCTCAATA CAACGGTCGA ATCGAACGTG ACGGTGACGC TCGACGATGG CTCGCCGCCG CCATCGGGCA TCTACTTCCT GCGCCTCCGC ACGCCCGAAC GCGCAGGAAC CGGCGTCATT CTCGTTGTCT CGCGCGCTGC GCTCTCGCTT CAGGTCATTG GGCAACGCGC AATTGTGTGG ACAACGGATA CGGTCAGCGC AACGGTGATC CCTGATGCGC CGCTTGCACT TTACCGTCAA GGTTCACCGG TCGCCGTCGG GCGCAGCGAT GACCGCGGGG TGTGGACCAT CGATTTGAGC AGTATGAATC CGCGCGATCT TGTGGCGATC AGCAGCGATC TGTCCACATT CGCTGCATTG GAAACGCCAC CACCGACCGC GCCTGCGCCG CGCCTGCGCA TTGTTCTGGC AACCGACCGC ACTGTCTATT CGCCAGGCGC ACAGGTATCG ATCCGCGGTT TTGCCCGTCA GGTTGACGGG CAATCGTTCG AACCGCCAAC GCCAGGATTG CCGCTCTCTC TTGAAGTGCG CGATCCATCC GGTCGCACAG TGCAGAAGCG GATCACCCTC GATGGAACAG GCGTCTTTGA GACAACGTTG CCGCTTTCAG ACAATGCGCA ACCCGGTATC TATCGTGTTT CGACACCGCA GGATACAGAC AGCACACTTG CATTTTATGT CGATGAATCG ATGCCGCTCC GTGTGACGGT CGCCAGGGAT CACAACAACG ATGTGCTGGT AACCGTGCGC ACCCCGGAAG GACTGCCGGT TGCCGGTGCC GATGTCGCCT GGACTATCGA CTGCGAACCG CTGTCCCCTC CAATGAATGG CGAGATCGTG TTCGGCGCAG CCGACCCTCC ACCGCCGTTG TCGGGCACAG GAGTAGCCGA TAGCAACGGC GTGCTGGTGA TTGCCACACC CTCTCTATCG CCTGCTTCGT TCTATCGCTA CCGCTTCCGC GCCCGGGTTG CCGAACCATA TGGACCTGCA ATCACTATCG AACGACTGCT CGAAACGCCG CCTGCGCCAC TCGTCGGGCT TCGCACGCTG TCGTCGATTG TCCGTGTGGG GACGCCAGCG AGTGTCGAAG TCATCACACT TATCGGCGAT CAGCCGCTCG CCGCACAGCG TGTGCAGATC GAGGCGACGC TGCTCAATGG CGAGGCATCC AATGACGCTG CGGCATCGCC TGTGGATCGA CCGATTCTGA GTCGTGTGGT CGAGACCGAC AGCGATGGAA GAGCGATATT GAGCGTACCG CTTCCGGCGC CTGGCGTCTA TCGGGTGCGC GCTTCACTCG ACAGCGTCGC TCAGGCGACT CCGCCAACCG ACCTTATTCT GCGCGCATAC CAGCCTGGCT TTACCGATTG GCGCGGAGGG CAGCAAGGAA CGATGCTGCT GACCGACCGC CGTCAGTATC GACCAGGCGA TACGGCGCTG CTCCTGCCGC TCACGGCGCT TCCTGAAGGA CCAGCATTGC TGACCGTCCA CAGCAGTTCG GGAGATGTCC GGGAAGAATT GCGCACCCTG CGCGCCGGTG AACCGATGAC ATTGACCCTG ACGCCTGATG ATGCGCCTGG TGTTTGGGTG ACATTGACGC CTGCACTACG CCTGCCCGTT CAACACCCCT TGCAGGTCGA TCTACCGGTT GTAGCGGTGG ACACGTCGCT GTCACTCACG CTGACGACTG ATGCACAAAC GTATACACCC GATACGAACG CCACACTCAC ACTTACCGTC ACCAATGCAA ACGACACTCC GACACCGGTC GATACGCTGG TACGGATTGT GACCGATGAC GCTGAGATGC AGCAAACGGT TGTATGGCGT GTAGAACGAA CGAACGACGC CGGTGTGCTG CGCATCAATG CGCGGTTGCC GCGTTCTCCC GGCATCGTTC CGGTGGATGT GTGGGTTGCC GGTGAACGCG GTATCGGCAG AGTCAGCACG CGATTGACCG TCGTTCAACC GGTCGCCGCA CAGATTATTG CGCCGCCGTT TGCGCGCGCT GGCGATCAGA TCGATACCCG CATTCGTCTC ACGACAACCG ACGGCATAAC GCGAGAAACC AGCGTCGCAC TGCGCCTCCC TGATGGAACA TCGAGCGTTC AGATCGTCAC AATTCCAGCG ACAGGCGTGA CGTCCCTCCC GTTCACGCTG CGCGCGCCTG ATGCTATTGC CCTGGAGGTG CAGGCGACGG TGACAACCGG CGCAGCATTC AGCGAGACAG TGCGCACGAC GCTTCCTGTA CGCTCGCCTG CCGCAACGAT CAGCAGCTCA GGCGGCGCTC TTGTGACCGA TCGCTTCGAG ACGCAGATCG CAAGACCCGG TGATGGTTCG GCGGAGTGGG GGTGGCTCGA TCTCGCCGTT GCACCGTCGC TGAAGGGGCT GTCGCTGGAA CAGTCGCGCG CGTTCATGGC ACTTCCTGAC CGACATTCGC TCGAAGATGC GGCTATCATC CTGATGGCAG CGTCGCTGAC AGAGGCGCGC CCGGATGTGC AGGCAGCAAC CAAACACCTG GTAGAACGGC AGACAAACGA TGGGGGATGG TCATGGCGCT TCGGTGGCGG GTCAAATCCA GCCGTCACTG CCATCGTGCT CGAGGCGCTC GCCGACGCAA AGGCAGCCGG TGTCCCGTTG CCCGACGGGT CGCTTGAACG CGCGACAACC CTGGCGCTTC GTCTTGTGCG TGATCCTGAT GTTCCGCTCG AAACGCGCTT CTGCCTGAGT TCTGCGCTGA CCCAACTCGG TGTTGTTGAG TCGTTGCTTC CGCGTGACTG GGACGAGGGT GAACTGGACG TTCACGGGAT GGCATGCCAT CTGTTCGTGT TGCCGCCGGA TCAGGCGCGC GTCAGCCCGA CGCTGCCCCG GCTCATCAGT CTGGCGCAGC GCGCAGATGG AAAGGCGTGG TGGAGCGCAC CGCCCGATAG TGCGTTCCCT TATGACGACG TAGCCACAAC GGCGCTGGCG ATGCGTGCGA TCCATCACGC ATCCCCGCGC CATCCGCTGG CAACCGACGC CACGCGTTGG CTGATCGCTC GGATGACGCC AACCGGATGG GGCGACGCGC TGACAACTGC GCGCGTTGTG CAGGCGCTGC GCGTCATTAT GCCAGCAGAT GCGTCGGCAT CCGTTACCCT CTCCCTGAAT GGCGCACCCA TCACCCTGCC CGATACGCCG GATGCCACGC TGCGCCTGGT TCCCATCCCC ATTGCCGATC TGCGCCCAAC CAACACGCTG GTTGTCACCA GTAGCGGCGC GCCGGCGCTG GTCGCGTGGC AGACGACACA CGCGGTCAGC GCTCCCCTCT CATTCGAAGG CGTCGGTCTG CTCCGCGAAT ATCTGGACCC ACGCACTGGT GCGCCGATCA ATCCGGTGGG GTTGAAACAG GGACAGTTGG TGCAGGTGCG GCTGACACTC GCAGCATTCC ATGAACGACG CTTTGTGTCG GTGCGTGATG CGCTTCCCGC AGGATTTGCG CTGGTCGAGA CCGACGCCGG TTCGATTTTC CAGATTGACG CTTTTGACGA CCGGATTGAA ATTGCCGCTG AAACATTGCC GCCTGGCATT CACCAATACA CCTATCTGGC GCGTGCGTCT GTTGCGGGAG CATACGCTGC ACCGCCACCC AAATTGATCC TGCCTGGAGG TCGCGCATTC ACCGGCGTGG CGACGGCAAA CATGGTGCGG ATCGATGCGG CCGCACGAGC ATGA
|
Protein sequence | MRVLRQIGVI WTGALLVIVV ASLALPGARF LILPALSAVP AVVSVSPPDG ARSVSPRAVL TIQFSASMNP PSVEHALRID PATDVVYGWD LDRTTLTITP TTALQAGVRY HITVAETALS RYFRPLAQPF TFAFETAPPA AITALLPRDG SVDVALDTLV GVRFSRPIVS LDALARSATL PALRSDPPLA GSVTWLDPAT LLFRPSEPLR PGVRYTFSLD PDLTDQSGVP LGRAYTWSFT TLAPMVREVS PPPNARLVGP YEPLRMVFSQ PVDLQALEAA LSITPAAPGT LEEALLPDGT QIVTYTPTIA WQAGTAYTVA LPAALADGTA LLAQPYRWSF VTAPKPALIG RFPGEGQLLP PGSNVRLIFS TPVDAEALRA QLRIDPPVDA LRVTTNDGEA RIDASLQAAT LYTITIPASL TDRAGVALER DYQVRFFTAP AVPSLTLPEA TGRVIQALPD QTIGVLTRRT NLSELRLALY RLDEATLLRA LSFSDSEWTA FEPSRYGLSL LRSWSEPLAD PLNTTVESNV TVTLDDGSPP PSGIYFLRLR TPERAGTGVI LVVSRAALSL QVIGQRAIVW TTDTVSATVI PDAPLALYRQ GSPVAVGRSD DRGVWTIDLS SMNPRDLVAI SSDLSTFAAL ETPPPTAPAP RLRIVLATDR TVYSPGAQVS IRGFARQVDG QSFEPPTPGL PLSLEVRDPS GRTVQKRITL DGTGVFETTL PLSDNAQPGI YRVSTPQDTD STLAFYVDES MPLRVTVARD HNNDVLVTVR TPEGLPVAGA DVAWTIDCEP LSPPMNGEIV FGAADPPPPL SGTGVADSNG VLVIATPSLS PASFYRYRFR ARVAEPYGPA ITIERLLETP PAPLVGLRTL SSIVRVGTPA SVEVITLIGD QPLAAQRVQI EATLLNGEAS NDAAASPVDR PILSRVVETD SDGRAILSVP LPAPGVYRVR ASLDSVAQAT PPTDLILRAY QPGFTDWRGG QQGTMLLTDR RQYRPGDTAL LLPLTALPEG PALLTVHSSS GDVREELRTL RAGEPMTLTL TPDDAPGVWV TLTPALRLPV QHPLQVDLPV VAVDTSLSLT LTTDAQTYTP DTNATLTLTV TNANDTPTPV DTLVRIVTDD AEMQQTVVWR VERTNDAGVL RINARLPRSP GIVPVDVWVA GERGIGRVST RLTVVQPVAA QIIAPPFARA GDQIDTRIRL TTTDGITRET SVALRLPDGT SSVQIVTIPA TGVTSLPFTL RAPDAIALEV QATVTTGAAF SETVRTTLPV RSPAATISSS GGALVTDRFE TQIARPGDGS AEWGWLDLAV APSLKGLSLE QSRAFMALPD RHSLEDAAII LMAASLTEAR PDVQAATKHL VERQTNDGGW SWRFGGGSNP AVTAIVLEAL ADAKAAGVPL PDGSLERATT LALRLVRDPD VPLETRFCLS SALTQLGVVE SLLPRDWDEG ELDVHGMACH LFVLPPDQAR VSPTLPRLIS LAQRADGKAW WSAPPDSAFP YDDVATTALA MRAIHHASPR HPLATDATRW LIARMTPTGW GDALTTARVV QALRVIMPAD ASASVTLSLN GAPITLPDTP DATLRLVPIP IADLRPTNTL VVTSSGAPAL VAWQTTHAVS APLSFEGVGL LREYLDPRTG APINPVGLKQ GQLVQVRLTL AAFHERRFVS VRDALPAGFA LVETDAGSIF QIDAFDDRIE IAAETLPPGI HQYTYLARAS VAGAYAAPPP KLILPGGRAF TGVATANMVR IDAAARA
|
| |