Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1757 |
Symbol | |
ID | 8568409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | - |
Start bp | 2047521 |
End bp | 2049530 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | Fibronectin type III domain protein |
Protein accession | YP_003291029 |
Protein GI | 268317310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000152672 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGGAG CATACTGGGG ACACTGGCGG GGCCTGTGGA TCAGTGCAAA AAACTGGCAG TCTCCAGATG GCCAGACCTT CCCGGTGCGC ATTGAGCATA TTGGCCCTCG GTTTAATGGG TTAGGGGAAT ATTTTCAGGA TGTGATCAAG CTGGTATATA AGTTTGATGC GCCAGAGATT ATCGTCGACG GGCTAACCTC TTTTGACAAG CCTGCTGTTC CGGATGAAAT TAACCCGGAT ATTCCTGCCG ATGCGATGGT GGAGAACATA GCCCATACGG CAATGGGAAT AGAAGTGCGT CGGCGGGTTT ATCAGTTTAG CAATGAGCAG CATCAGGATT ATCACATTAT TGAGTATGTG TTCACGAATA CAGGCAATGT GGATGAAGAT GAGGACATAG AACTGCCTAA TCAGACGCTG GAGGACGTTT ATTTCACCTT CTTCTACCGC AACAAAGCCA ATGCGCCGGC TGGGGCCTGG GATAACTCAG CGGGAGGTGC TGCGTGGGGG AAGTACACCA TGAATGACGC CCTGGTGCCG GACTGGGCTG ATATGCCAGG TGAGCAGTTT AGCGAGCAGT TTGCGCAAGG CTACGACTTC TGGCAAGATC ATGCCGCGCA GTTCTCCTGG CTGGGGCATG TGCCGGATCA GACCAACTTT AATACCATTG GTAATCCCAT GTGGTTTGAG CTCCAGCCCT GGATCGCGCA CATCGGCGGC GATACGACAG GACGCCTAGG AGCAGCCGCT ATGTTCGGCA CGCTGACGGT TCATGCCGAC CGCTCGGCAA GTGACGAATC GCACGATAGA GCGCAGCCGA GCATGATGGA CATCCTGGAC TCAGACGATG CCGACCTGAC CAGCCGGAAT GATCACAATG ATATTAATCA GATGCAGTTC GAGCGGGACT GGCTAGAAGA TGGGTTCAAG AATGGGACAG GGTCGGCTCG CTATAGCGAT GAGAAGCCGC CGCATGCCTG GCGTATTCAG CCAGATGGTG ATTTTGCGCG GCAGACAGCC CCACCGCAAC CCAGTGAAGG AGGGTATGGA TACGTGCAGA GCTTTGGACC CTATACGCTA GGGCCAGGTG AGGATGTGCG GATTGTGGTT GCTGAAGCGA TTGCCGGGCT GAATGATAAG CTGGCCTATG CCCTGGGGCG CTGGTATAAG CAGCAGGTTC GTCTGCAGGG ACAAGAAGTG GCGAATAACC TGCTATTCTA CTGGAATCCC ACAACCAACA CCTCCTGCAA TCAGGGCGAT CCGGGCTGTA TTGGCAGGAC CAAGAACGAC TGGGTCATGA CCGCCCGCGA CTCGCTCTTC AAACGCTTTG ACCAGATCCT CGAGGTCTGG AATAATGGGA TGCAGGTCCC GCAGGCTCCG AAGCCGCCGC GGCGATTTGT GGTAAGCTCC GGCACGGATC AGATTACCCT GGAGTGGGAA ACCTATGCAG GTGAGCCGGA TCCGGCAGGG TGGGAAATCT GGCGGGCGCA GAACTATTAC TTTGGCATCC CCCTGCCAGA TAGCTCCACT GTCTATAAGA AGATTGCCGA ACTGCCAGGG AATGCCCGTT CCTATATTGA TACCGAAGTG ACCCGTGGAG TCAACTACTT CTACTACATT CAGGCGGTCG GCAGCAATGG CTTGAAGAGC AATCGGTACT GGACACAGAC CTATCTGCCG GCCGTACTCC GGCGGCCGCC CGGAGCTTCG TTGGATGACG TACGGGTTGT CCCCAACCCC TATGTGTTGG AGGCCGATCT GGGCGTGCGC TTCCCGGATG TTCAGGATAA AATTGCCTTC TACGGGCTGC CACCACAGGC CACGATTCGG ATTTACACAG AGCTGGGTGA ACTTGTGACG GTGATTGAGC ACACGGACGG AAGTGGTGAC GAATTCTGGA ATCTGACCAC TTCGTCCCGT CAGGTGGTGG CCAGTGGGAT CTACTACGCT GTGATCACGG ACAAGGAAAC GGGCAAGCAG ACTACTCGGA CGATCGTGAT CATTCGCTGA
|
Protein sequence | MIGAYWGHWR GLWISAKNWQ SPDGQTFPVR IEHIGPRFNG LGEYFQDVIK LVYKFDAPEI IVDGLTSFDK PAVPDEINPD IPADAMVENI AHTAMGIEVR RRVYQFSNEQ HQDYHIIEYV FTNTGNVDED EDIELPNQTL EDVYFTFFYR NKANAPAGAW DNSAGGAAWG KYTMNDALVP DWADMPGEQF SEQFAQGYDF WQDHAAQFSW LGHVPDQTNF NTIGNPMWFE LQPWIAHIGG DTTGRLGAAA MFGTLTVHAD RSASDESHDR AQPSMMDILD SDDADLTSRN DHNDINQMQF ERDWLEDGFK NGTGSARYSD EKPPHAWRIQ PDGDFARQTA PPQPSEGGYG YVQSFGPYTL GPGEDVRIVV AEAIAGLNDK LAYALGRWYK QQVRLQGQEV ANNLLFYWNP TTNTSCNQGD PGCIGRTKND WVMTARDSLF KRFDQILEVW NNGMQVPQAP KPPRRFVVSS GTDQITLEWE TYAGEPDPAG WEIWRAQNYY FGIPLPDSST VYKKIAELPG NARSYIDTEV TRGVNYFYYI QAVGSNGLKS NRYWTQTYLP AVLRRPPGAS LDDVRVVPNP YVLEADLGVR FPDVQDKIAF YGLPPQATIR IYTELGELVT VIEHTDGSGD EFWNLTTSSR QVVASGIYYA VITDKETGKQ TTRTIVIIR
|
| |