Gene Rmar_1757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1757 
Symbol 
ID8568409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp2047521 
End bp2049530 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content55% 
IMG OID 
ProductFibronectin type III domain protein 
Protein accessionYP_003291029 
Protein GI268317310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000152672 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGAG CATACTGGGG ACACTGGCGG GGCCTGTGGA TCAGTGCAAA AAACTGGCAG 
TCTCCAGATG GCCAGACCTT CCCGGTGCGC ATTGAGCATA TTGGCCCTCG GTTTAATGGG
TTAGGGGAAT ATTTTCAGGA TGTGATCAAG CTGGTATATA AGTTTGATGC GCCAGAGATT
ATCGTCGACG GGCTAACCTC TTTTGACAAG CCTGCTGTTC CGGATGAAAT TAACCCGGAT
ATTCCTGCCG ATGCGATGGT GGAGAACATA GCCCATACGG CAATGGGAAT AGAAGTGCGT
CGGCGGGTTT ATCAGTTTAG CAATGAGCAG CATCAGGATT ATCACATTAT TGAGTATGTG
TTCACGAATA CAGGCAATGT GGATGAAGAT GAGGACATAG AACTGCCTAA TCAGACGCTG
GAGGACGTTT ATTTCACCTT CTTCTACCGC AACAAAGCCA ATGCGCCGGC TGGGGCCTGG
GATAACTCAG CGGGAGGTGC TGCGTGGGGG AAGTACACCA TGAATGACGC CCTGGTGCCG
GACTGGGCTG ATATGCCAGG TGAGCAGTTT AGCGAGCAGT TTGCGCAAGG CTACGACTTC
TGGCAAGATC ATGCCGCGCA GTTCTCCTGG CTGGGGCATG TGCCGGATCA GACCAACTTT
AATACCATTG GTAATCCCAT GTGGTTTGAG CTCCAGCCCT GGATCGCGCA CATCGGCGGC
GATACGACAG GACGCCTAGG AGCAGCCGCT ATGTTCGGCA CGCTGACGGT TCATGCCGAC
CGCTCGGCAA GTGACGAATC GCACGATAGA GCGCAGCCGA GCATGATGGA CATCCTGGAC
TCAGACGATG CCGACCTGAC CAGCCGGAAT GATCACAATG ATATTAATCA GATGCAGTTC
GAGCGGGACT GGCTAGAAGA TGGGTTCAAG AATGGGACAG GGTCGGCTCG CTATAGCGAT
GAGAAGCCGC CGCATGCCTG GCGTATTCAG CCAGATGGTG ATTTTGCGCG GCAGACAGCC
CCACCGCAAC CCAGTGAAGG AGGGTATGGA TACGTGCAGA GCTTTGGACC CTATACGCTA
GGGCCAGGTG AGGATGTGCG GATTGTGGTT GCTGAAGCGA TTGCCGGGCT GAATGATAAG
CTGGCCTATG CCCTGGGGCG CTGGTATAAG CAGCAGGTTC GTCTGCAGGG ACAAGAAGTG
GCGAATAACC TGCTATTCTA CTGGAATCCC ACAACCAACA CCTCCTGCAA TCAGGGCGAT
CCGGGCTGTA TTGGCAGGAC CAAGAACGAC TGGGTCATGA CCGCCCGCGA CTCGCTCTTC
AAACGCTTTG ACCAGATCCT CGAGGTCTGG AATAATGGGA TGCAGGTCCC GCAGGCTCCG
AAGCCGCCGC GGCGATTTGT GGTAAGCTCC GGCACGGATC AGATTACCCT GGAGTGGGAA
ACCTATGCAG GTGAGCCGGA TCCGGCAGGG TGGGAAATCT GGCGGGCGCA GAACTATTAC
TTTGGCATCC CCCTGCCAGA TAGCTCCACT GTCTATAAGA AGATTGCCGA ACTGCCAGGG
AATGCCCGTT CCTATATTGA TACCGAAGTG ACCCGTGGAG TCAACTACTT CTACTACATT
CAGGCGGTCG GCAGCAATGG CTTGAAGAGC AATCGGTACT GGACACAGAC CTATCTGCCG
GCCGTACTCC GGCGGCCGCC CGGAGCTTCG TTGGATGACG TACGGGTTGT CCCCAACCCC
TATGTGTTGG AGGCCGATCT GGGCGTGCGC TTCCCGGATG TTCAGGATAA AATTGCCTTC
TACGGGCTGC CACCACAGGC CACGATTCGG ATTTACACAG AGCTGGGTGA ACTTGTGACG
GTGATTGAGC ACACGGACGG AAGTGGTGAC GAATTCTGGA ATCTGACCAC TTCGTCCCGT
CAGGTGGTGG CCAGTGGGAT CTACTACGCT GTGATCACGG ACAAGGAAAC GGGCAAGCAG
ACTACTCGGA CGATCGTGAT CATTCGCTGA
 
Protein sequence
MIGAYWGHWR GLWISAKNWQ SPDGQTFPVR IEHIGPRFNG LGEYFQDVIK LVYKFDAPEI 
IVDGLTSFDK PAVPDEINPD IPADAMVENI AHTAMGIEVR RRVYQFSNEQ HQDYHIIEYV
FTNTGNVDED EDIELPNQTL EDVYFTFFYR NKANAPAGAW DNSAGGAAWG KYTMNDALVP
DWADMPGEQF SEQFAQGYDF WQDHAAQFSW LGHVPDQTNF NTIGNPMWFE LQPWIAHIGG
DTTGRLGAAA MFGTLTVHAD RSASDESHDR AQPSMMDILD SDDADLTSRN DHNDINQMQF
ERDWLEDGFK NGTGSARYSD EKPPHAWRIQ PDGDFARQTA PPQPSEGGYG YVQSFGPYTL
GPGEDVRIVV AEAIAGLNDK LAYALGRWYK QQVRLQGQEV ANNLLFYWNP TTNTSCNQGD
PGCIGRTKND WVMTARDSLF KRFDQILEVW NNGMQVPQAP KPPRRFVVSS GTDQITLEWE
TYAGEPDPAG WEIWRAQNYY FGIPLPDSST VYKKIAELPG NARSYIDTEV TRGVNYFYYI
QAVGSNGLKS NRYWTQTYLP AVLRRPPGAS LDDVRVVPNP YVLEADLGVR FPDVQDKIAF
YGLPPQATIR IYTELGELVT VIEHTDGSGD EFWNLTTSSR QVVASGIYYA VITDKETGKQ
TTRTIVIIR