Gene RoseRS_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4020 
Symbol 
ID5211003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5029577 
End bp5030902 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content56% 
IMG OID640597609 
Productzinc finger SWIM domain-containing protein 
Protein accessionYP_001278315 
Protein GI148658110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCA TCACTGCCGA AGAAGCCAAA GCCCTTGCCC CTGACCTGGC TTCGCTTAAA 
GCCGCTCAGG AACTGGCAGA TATTTGTCAT TGGGTCAGCC TGGGCGCCAA CGAAGCAGCC
TTGTGGGGTG AATGCAAGGG CAGTGCACAA AAGCCATACA AGGTGCAGGT AGATCTGTCC
AATCGCGGCT TTGCCTGCAC CTGTCCAAGT CGCAAATCTC CCTGTAAACA TGTCCTGGGA
TTGATGCTGC TGGCATCGGC TTCGCCAACC ATATTGAAAG ATGCCACACC CCCTGCCTGG
GTCTCTGAAT GGCTGGAAAA ACGCACCGAC CAGACGTCGG ATCACGCGGG ACAGGCTGAA
CCCGACACTG CTTCGTCGCA GTCGCGTCAG AAAGACGCTG CACGCCGCGC CGCCAGACGT
GAAAAACTGG TTGCTGCGGG TCTCGAAACG CTCGACCTCT GGCTCAAAGA CCTGATCCGC
CAGGGCCTGG CTTTCGCCCA GAGTGCGCCG GCTTCGTTTT GGGAACAGCA ATCTGCCCGC
CTGGTAGATG CCCAATTGCC TGGTGCGGCG CGGATGGTAC GCGAGATGCG GGACATTCCC
GGCGCCTCCC CCAACTGGAC GGAAATTCTG CTCCTGAAAA TGGCGCGACT CCACCTGCTC
ATCCAGGCCT ACCGTCGTCT GGAAAGCCTG CCCGAACCCT CACGCACCGA TGTGCGCACC
CTGCTCGGCT GGACTATCAA TCGCAAGGAG TTGATCCTCT CTTCTCCCGC CCTGAGCGAT
GACTGGCTGG TAGTGGCTCA GACCCTCGAG GAAGACGAGA CAAGTGGGTT GCGCACCCAA
ATCAACTGGT TGTGGGGCAA AACCAGCCGC AAACCTGCTC AACTTATCCT CTTCGCTTTC
AGAACCAGAC CCTTCGAAGA TCACCTTTTC CCTGGCCTGA CCTTGCGCGG TGATCTGGTT
TACTTCCCCA GCGCATACCC CTTGCGAGCC GTTTTCAAAA GCTACCGGAG GCTGGAATCC
ACTTTTGTTC CTGCTGGCTT GCCGAATTTT CTCGCCTTTC TCGATGCGTA TTCTACCGCG
CTCGGCCTCA ACCCATGGTT AGAGCATTTT CCTGTCGTGT TGGAACATGT GACCATTGAA
AGGTTGGAAA CAAACTGGCT TTTGTGTGAT GGTGAGAATC AGGCAATCCC CGTATCCTCC
CGTTCCCACT GTTGGGAACT TCTTTCTCTT TCTGGCGGTC ACCCTCTCAC CGTTTTTGGC
TTGTGGGACG GATTCTCATT TTTCCCCATG ACGGCCTGGG AGAACGAAAG GTTTGTTCGC
CTATGA
 
Protein sequence
MSSITAEEAK ALAPDLASLK AAQELADICH WVSLGANEAA LWGECKGSAQ KPYKVQVDLS 
NRGFACTCPS RKSPCKHVLG LMLLASASPT ILKDATPPAW VSEWLEKRTD QTSDHAGQAE
PDTASSQSRQ KDAARRAARR EKLVAAGLET LDLWLKDLIR QGLAFAQSAP ASFWEQQSAR
LVDAQLPGAA RMVREMRDIP GASPNWTEIL LLKMARLHLL IQAYRRLESL PEPSRTDVRT
LLGWTINRKE LILSSPALSD DWLVVAQTLE EDETSGLRTQ INWLWGKTSR KPAQLILFAF
RTRPFEDHLF PGLTLRGDLV YFPSAYPLRA VFKSYRRLES TFVPAGLPNF LAFLDAYSTA
LGLNPWLEHF PVVLEHVTIE RLETNWLLCD GENQAIPVSS RSHCWELLSL SGGHPLTVFG
LWDGFSFFPM TAWENERFVR L