Gene RoseRS_3113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3113 
Symbol 
ID5210081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3910455 
End bp3911702 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content59% 
IMG OID640596704 
Productcysteine desulfurase family protein 
Protein accessionYP_001277426 
Protein GI148657221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000139968 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00618461 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACCCCC TCGATCTGAC CTGGATTCGC GCACAGTTTC CTGCGCTGGC GCAAGAAGTG 
AATGGACATC CCGCCGTGTT TTTTGATGGT CCAGGCGGAA CGCAGGTTCC GCAGCGGGTG
ATCGATGCTG TCGCCGATTA TCTGATCCAC CACAATGCCA ATACTCATGG CGCATTTGCA
ACCAGTCGCC GCACTGATGA AACGATTGAC GCGGCGCGCG CCGCTATGGC TGATTTTCTG
GGGTGTGCTG CGGACGAGGT GGTTTTCGGA CCAAACATGA CCACGCTGAC CTTTGCGATC
AGCCGCGCAT TTGGGCGTGA CATTCGCCCC GGTGATGAGA TTGTCCTGAC GCGCCTGGAT
CATGATGCCA ACGTCGCACC CTGGAAAGCG CTCGAAGAAC AGGGCGCCGT CATTCAGATG
GTCGATATCG ACACCGAAGA ATGCACCCTC GATATGGCGG ATATGGCGCG CGCCATCGGT
CCACGCACGA AACTCGTCGC GGTCGGGTAT GCGTCGAACG CCGTGGGAAC GATCAACGAC
GTGGCGACCA TCACACGGAT GGCGCACGCG GTCGGTGCAC TGGTGTATAT CGATGCAGTG
CACTACGCCC CGCACGGACC AATCGATGTG CGGGCGCTCG ATTGCGATTT TCTCGCGTGC
TCGCCGTACA AATTCTTTGC ACCGCATATG GGAGTTTTAT ACGGCAAACG TGAGCACCTG
GCGCGCCTGC GTCCGTATAA GGTTCGACCC GCCTCTGACG ATGTTCCTGA TCGCTGGGAA
ACTGGAACCA AAAACCACGA AGGGTTAGCC GGGGTAACGG CGGCAATCGA GTACCTGGCA
GAACTTGGGC AGCGCATCAA GCCAGCGACG ACCCGACGCG CGGCGCTGGT GCAGGCGATG
GAAGCGATCA AAGCGTATGA ACGCGGATTA TCGGAGCAAC TGATCGCCGG TCTCCTTGCA
ATTCCGGGAT TGACCTTCTA CGGTATCAGC GACCCGGCGC GTTTCGACAT GCGCACGCCG
ACCGTGGCAG TGCGTCTTGC CGGACGCACA CCGCGCGAAC TTGCCGAAGC GCTGGGACGG
CGCGGCATCT TCTGCTGGGA CGGCAACTAC TACGCGATCA ATCTGACCGA GCGCCTGGGC
GTTGAAGCTG ATGGCGGCAT GCTGCGTATT GGTCTGGTGC ACTACAACAC CGTGGAAGAG
ATCGAACTAT TGCTGGAAGC GCTGAACGAA CTGAGGATCG GGAACTGA
 
Protein sequence
MHPLDLTWIR AQFPALAQEV NGHPAVFFDG PGGTQVPQRV IDAVADYLIH HNANTHGAFA 
TSRRTDETID AARAAMADFL GCAADEVVFG PNMTTLTFAI SRAFGRDIRP GDEIVLTRLD
HDANVAPWKA LEEQGAVIQM VDIDTEECTL DMADMARAIG PRTKLVAVGY ASNAVGTIND
VATITRMAHA VGALVYIDAV HYAPHGPIDV RALDCDFLAC SPYKFFAPHM GVLYGKREHL
ARLRPYKVRP ASDDVPDRWE TGTKNHEGLA GVTAAIEYLA ELGQRIKPAT TRRAALVQAM
EAIKAYERGL SEQLIAGLLA IPGLTFYGIS DPARFDMRTP TVAVRLAGRT PRELAEALGR
RGIFCWDGNY YAINLTERLG VEADGGMLRI GLVHYNTVEE IELLLEALNE LRIGN