Gene RoseRS_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1520 
Symbol 
ID5208475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1852866 
End bp1856063 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content61% 
IMG OID640595127 
ProductFHA domain-containing protein 
Protein accessionYP_001275863 
Protein GI148655658 
COG category[T] Signal transduction mechanisms 
COG ID[COG1716] FOG: FHA domain 
TIGRFAM ID[TIGR02806] clostripain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00810345 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000340593 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCAGCAT ACCTTGTTGC GGTCAGCGGC GCGCAAGCCG GGAAGCAGTT TCCGCTCACC 
GACGCTCCAT GTTCGTTTGG ACGCAACCCG GATAACGCGA TTGTGGTTGC CAGTGCACGC
GCATCGCGTC GCCATGCCGA GATCCGCCGC GAAGGGGGAG ACTTTATTCT CTACGATCTG
GGGAGCGCAA ACGGCACGCT GGTCAATGGT CAACGCATCG CCGCACCGCA CCGGCTCCGC
AGTGGCGACC TGATCGAGAT CGGCGATGAG ACCTTTCGCT TTGAGCAACC GCAACCTGCG
GTCGATGCAA CGCTTATCGC GTCACCCGGT CCCATCTCTG CTCCACCAAC CGCACCGGCA
ACACCACCGC AACCGGCGCA GGGATTTCGG TTGCCCCCAT CGCAACCGCC CTCCGCACCA
CCTCCTTCCC AGACGCCGCC TGCGCCGCCA CCGTACCAGC CACCCCCCGC GCAACCGCAG
GGGTTTCAGG TTCCGCCCGC ACCACCGTCG TACCAGCCCC CCCCCGCGCA ACCGCAGGGA
TTTCAGGTTC CACCCGCAAC CCCGTCGTAC CAGCCCCCCC CCGCGCAACC TCCATCGGCA
GCGCCGCCTG CGCGCAAAGG AGGCATGCCA CGATGGCTCA TTCCGGTAGC GCTCATTCTC
GTTGTGCTGG CAGTGGCGTG CGTCGGCAGC GCCGTCGTTG TGACGCGCGG GATCGATGGG
TTGATCCAGA ACACAACGCC GCAGAGCGGT GCGACGACCA GCGTTCCATC CTCGCCTACC
CCGCAAAACA CAGGTGGAAG CACCCCCGTG CCGCCACCCG TAACTCCGGT TGCACAGCCG
ACCGGCGACA GAGCCGCCTG GACCGTGCTC GTCTACCTGG ATGGCGACAA CAATCTGGAG
AGTGACGCCG TCATCGATTT CAACGAAATG GAACTCGTCG GCTCGACCGA TCAGGTCAGG
ATCGTCGTGC AGTTCGACCG CATCGGCGCA GCCGCCCCCT GGGACGATAC GTCCAACGGC
GACTGGGAGA CAACCAAGCG TTTCCTGGTC GAGCGTGATG ACGACCCGGA TACTATTCGC
TCACGCGAAG TGGAGGACCT GGGTGAGTTG AATATGGGCG ATCCGCAGAC CCTGGTCGAC
TTTGCAGTCT GGGGGATGCA AACCTATCCC GCCGAGCGCT ATGCGCTCAT TCTGTGGGAT
CATGGCGCAT CGTGGGCAGG GATCGCGTTC GATGACACCG ACGGCAAGGA TGGGATCAAT
ATGCCAGAAC TCGACGCGGC GTTGCGCACC ATTCAGCAGC AGACGGGGCA GCGGATCGAC
CTGATCGGCT TTGATGCCTG CCTGATGGCG CAGATCGATG TTGCGCTGGT TGTGGCGCCG
TATGCCGATG TGTTTGTGGC ATCCGCCGAA CTCGAACCCA ACACCGGTTG GGCATGGGAC
CTTCTGCTGC GCCGTCTGGT CGAAAATCCG CAGCAGGACG CGGCAACGTT CGGCGCGGGA
ATTGTGGAAT CCTACCGTGA GTTCTATGAG AGGCGCGACG ATCCGACCGT GACGCTCTCG
GCGTTCGACC TGACCCGCGC CAACGACCTG CGTCAGAAGT TGAACGCTCT CTCGGATGCC
ATGCTGAAAG GAATGGGCGA TTCGTACACT GCGATTGCCG AGGCGCGATC ATTCGTCGAT
GTGTACAGTC AGCCTGCGCC TGAAGAGTTC AGCGCCGTCG ATCTGGGACA TTTCGCCCGT
CTGGTTGTTG ATCGCGGAGC GCGTCCAGCG GTGGCGGATC CGGCGCGCGC GTTGTTTGAG
GCAATCGATC AGGCGCGTAT CGCTGAGTGG AATGGCGGAT TCCACGCCAA CTCGACCGGA
TTGTCGATCT TCTTCCCGCA GTATGCCCAG CTCTATCCGC CAATCTACGG GCAGGGATCG
CCGTTGCCAC AACAGACGTC GTGGGGCGAT TTCCTGAACG CATTCCACAC TGCCAACACC
GGTCTGCGAA GCGCTGCCGA GATCAGTTCG CTCTCTGTGT CGAACACGAC CGTCAACATT
GACAACCCGA TCACCGTTGA GGGGGTTGTG ACCGGCGACA ACATTGCCTA CGTGTTCTTC
TTTGCAGGCA TTCCCAACAA TGATCGGAGC GGTGTGTTGT TGACCAGCAT CGACTTCCTG
TACCCGCCGG GAAGTGTGCC GGGCAGCAGT ATTCCCCCCT GGACGGGCGG CAGCACGCGC
CTCAGGGTCA ACTACAGCGG CGCACAGTGG GGGTTGACCA ATGGACGGGA CACTATCCCG
GTGCTGTTGG GACCGGCGAA GTATGGCACT GCGCTCTATG GCGTCGAAGG GATCTATACC
GTTCAACGCA CGAGGGAGCG GATCGCCGCA GCGCTGGTGT TTACCGTTGA AAGCGGAGCG
CCAGAGTTGA TCACTATTTA CGGGTTCCCG AAGAATCAGA AACAGGAAGC GCAACCGTTC
GAGATTGTGC CGACGCCGGG TGATACGTTT ACCGCAGTGA TCCGCACATA TACGGTGAAG
GGTGATCGTC TGGAACCGGG ATTTGTCGAA GGAGACACCC TGACGATCGG CAACCAACCG
CTCGCCGTGC AGCGCATTCC GTTACCGGCT GGCGCGTATG TTGCCGGTTT TCTGGTGCGT
GATATTGCCG GACGCTTCAG TTATCAGTAT GCGGATATTA ATGTCAGCGC CGCCGGTTCT
GGCGTCAACG TGCCATCGCC GGTGCAGACG TCGCCTGGAA CCCAGTCCGG GTATCAGTTG
TACAACAATC CGCAACTCGG CTTCTCAATC GAATATCCGA CAGACTGGGC GACACTCGAC
ACCGGCAATG ATCATATCTA CTTCTACGAT CCGGCGGCAA ACGGCAACGT CTTCGTCAGC
GTCGATGTCT ATCAGACGGG TCTGCCGGTT AACGAAGCCA ACCAGCGATT GCTGGAGTTC
TTCCTGGATT CGCTCCAGAC CCAGCAGAAC TTCCAGCAGG CGCCCGGCGA TCCGCTGCGG
CTTGGCGGCG AGGTCGGACC GTCGGCGCGC TACCAGTACA CCGATCAGAA CGGCGTGACC
TTCAGCGGGA TCGTCATCGC AGTAACCAGT CCACGCACCG GCTTGAGTTA TCTGGTATCG
GTTCAGGCGC CGTCCGCCGA TTTCAGTCGC TACGCTGATA CGCTGGCGGC AATCGTCGAC
TCGATGAAGA TCAAATAG
 
Protein sequence
MPAYLVAVSG AQAGKQFPLT DAPCSFGRNP DNAIVVASAR ASRRHAEIRR EGGDFILYDL 
GSANGTLVNG QRIAAPHRLR SGDLIEIGDE TFRFEQPQPA VDATLIASPG PISAPPTAPA
TPPQPAQGFR LPPSQPPSAP PPSQTPPAPP PYQPPPAQPQ GFQVPPAPPS YQPPPAQPQG
FQVPPATPSY QPPPAQPPSA APPARKGGMP RWLIPVALIL VVLAVACVGS AVVVTRGIDG
LIQNTTPQSG ATTSVPSSPT PQNTGGSTPV PPPVTPVAQP TGDRAAWTVL VYLDGDNNLE
SDAVIDFNEM ELVGSTDQVR IVVQFDRIGA AAPWDDTSNG DWETTKRFLV ERDDDPDTIR
SREVEDLGEL NMGDPQTLVD FAVWGMQTYP AERYALILWD HGASWAGIAF DDTDGKDGIN
MPELDAALRT IQQQTGQRID LIGFDACLMA QIDVALVVAP YADVFVASAE LEPNTGWAWD
LLLRRLVENP QQDAATFGAG IVESYREFYE RRDDPTVTLS AFDLTRANDL RQKLNALSDA
MLKGMGDSYT AIAEARSFVD VYSQPAPEEF SAVDLGHFAR LVVDRGARPA VADPARALFE
AIDQARIAEW NGGFHANSTG LSIFFPQYAQ LYPPIYGQGS PLPQQTSWGD FLNAFHTANT
GLRSAAEISS LSVSNTTVNI DNPITVEGVV TGDNIAYVFF FAGIPNNDRS GVLLTSIDFL
YPPGSVPGSS IPPWTGGSTR LRVNYSGAQW GLTNGRDTIP VLLGPAKYGT ALYGVEGIYT
VQRTRERIAA ALVFTVESGA PELITIYGFP KNQKQEAQPF EIVPTPGDTF TAVIRTYTVK
GDRLEPGFVE GDTLTIGNQP LAVQRIPLPA GAYVAGFLVR DIAGRFSYQY ADINVSAAGS
GVNVPSPVQT SPGTQSGYQL YNNPQLGFSI EYPTDWATLD TGNDHIYFYD PAANGNVFVS
VDVYQTGLPV NEANQRLLEF FLDSLQTQQN FQQAPGDPLR LGGEVGPSAR YQYTDQNGVT
FSGIVIAVTS PRTGLSYLVS VQAPSADFSR YADTLAAIVD SMKIK