Gene RoseRS_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3938 
Symbol 
ID5210921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4927895 
End bp4929484 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content64% 
IMG OID640597533 
ProductFHA domain-containing protein 
Protein accessionYP_001278240 
Protein GI148658035 
COG category[T] Signal transduction mechanisms 
COG ID[COG1716] FOG: FHA domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCG CATCCCATCA GACGCAGTCG CTCTGTCCTG TCTGTGGCGC GCCTCAGCCG 
GGCGCCGCCC GATTCTGCTC ACGCTGCGGT CATCCGCTGA CCCAACCGCC TCAGACATTC
GAGATGCTGG TTACCCAGAA CAACGCCCCA CCGATGCGCG TACCCATCCC CGCCGCAACC
GTTACCATCG GTCGGGCGCA CGACAGCACC ATCCTGATCA GCGATCCAAA GGTTTCACGA
CGCCATCTGC AACTGACCTG GAACGGCGCG GCGTTCGTCG CTGAAGATGT CGGCAGCAGC
GGCGGGACGC TTCTGAACGG CATGCCGTTG CGCAGCCCGA CGATCCTGCG TCCGGGAGAC
ACGCTTTCGA TTGGAGATAC CATCCTGCGG TTGGAGATCG CGTCAGGCAC AGCAACCGTG
CTGGCGCCTC CGCGTGAACA GGCGCCCTCT GTGCCGCAGA CGCCGACCAC GCCCGCCCAT
CTCCAGCCAC CTGCATCTGC ATCTGCGCAA CCTGCTGCAC TGCCGCCCTA TCAACCGCCT
GCATCTGCGC AACCTGCTGC ACCGCCACCT TATCAACCAC CCGCATATGG GCAACCTATC
ACACCGCCGC CCTATCAGCC GCCTGCATCT GCACAACCTG GTGCACCGCC ATCTTATCAA
CCACCCGCAT ATGGGCAACC TGTCACACCG CCACCCTATC AATCGCCTGC GTCTGGGCAA
CCTGGTGCGC TGCCGCCCTA TCAGTCGCCT GCACCTGCAT CTGCGCAACC TGCCGCGCCG
CCTTACCAGC CGCCGTATCG TCAGGAACCG GTTGGCTTTC CGCCGCCTCC AGCGACACCT
GCCGCGCCGC CATATCCGGG AATGGCGCAG ATCCCGGTGT CGCCGCCGCC CCGTCGTTCA
TCACACATCG GCATCATCCT CGGCATTGTG GCGGTGTTTC TGCTGATCGG AGGAGGCGCA
GCCGTGGTGA TCCTCGGTCC GTGGCGCAGC CCCCTTGGAC CCGTCGCGCC CCCATCTTCT
GATCAAGAAC AGCCGGCGCC GCTGACCATG ACCCTCACGC CGCAGGAACA GACTATCGTT
GCTCCCGATG GTCAACCGCA CACCGACAGT CATGGCGCCA GCCTGATCGT GCCATCTGAC
ATCCTCGAGG AAGCGGCGCA TGTCGAACTG ATCGCAAGCA GCGCCCAGGG AACGCTCGCT
GATGCGCTGA GCCAGGACTT TACCATCGAA ACGCCTTTCT ACGCCATCGT TGCCGACAAC
GACGGGCGCG GGCGCGCGTC GCTGACACTG CCAGCCGCCA GTCCTGATTC GCGTGTCGCG
GTTGTGATCG ATACCACCTA TCTGGCGATC CTCGATACGC AGCCGGTCAA CGGCGTACTG
CACGTCGAAG CCGCTGTGAC ACCCAAAACG TTGCCCGATA CACCGGCGCC CGGCACAACG
CGCGATGGCT CGATCCACTA CGTCGTTCTC CGCGCAAAGT CTGGCAGTGC TGCATCGCCT
GCCGGCACGG GCAGCGCACT GGGGCTGAAC CTTGCGCCGC GCCGCGCCTA TGCTGCTGAA
GCGTCCCAGA CAAACACATA TATACCTTGA
 
Protein sequence
MNPASHQTQS LCPVCGAPQP GAARFCSRCG HPLTQPPQTF EMLVTQNNAP PMRVPIPAAT 
VTIGRAHDST ILISDPKVSR RHLQLTWNGA AFVAEDVGSS GGTLLNGMPL RSPTILRPGD
TLSIGDTILR LEIASGTATV LAPPREQAPS VPQTPTTPAH LQPPASASAQ PAALPPYQPP
ASAQPAAPPP YQPPAYGQPI TPPPYQPPAS AQPGAPPSYQ PPAYGQPVTP PPYQSPASGQ
PGALPPYQSP APASAQPAAP PYQPPYRQEP VGFPPPPATP AAPPYPGMAQ IPVSPPPRRS
SHIGIILGIV AVFLLIGGGA AVVILGPWRS PLGPVAPPSS DQEQPAPLTM TLTPQEQTIV
APDGQPHTDS HGASLIVPSD ILEEAAHVEL IASSAQGTLA DALSQDFTIE TPFYAIVADN
DGRGRASLTL PAASPDSRVA VVIDTTYLAI LDTQPVNGVL HVEAAVTPKT LPDTPAPGTT
RDGSIHYVVL RAKSGSAASP AGTGSALGLN LAPRRAYAAE ASQTNTYIP