Gene Rcas_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2971 
Symbol 
ID5540462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3854884 
End bp3856116 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content60% 
IMG OID640895090 
Productcysteine desulfurase family protein 
Protein accessionYP_001433048 
Protein GI156742919 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000129059 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000905656 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGCCC TCGATCTGAC CTGGATTCGT GCTCAGTTTC CTGCCTTGAT GCAGGAAATG 
AACGGTCGTC CCGTCGTGTT TTTCGACGGT CCTGGAGGAA CGCAGGTTCC CCGGCGGGTG
ATTGACGCAA TGGCGGAGTA TCTGACTCTG TATAACTCGA ATACGCATGG CGCTTTTGCG
ACCAGCCAGC GTACCGATGC AACGGTTGAC GCCGCGCGTG TTGCCATGGC TGATTTTCTG
GGATGCGACG CTGATGAGGT GGTCTTTGGT CCGAACATGA CCACGCTGAC CTTTGCGATC
AGCCGGGCAT TCGGGCGCGA TATCCGCCCC GGCGACGAGA TTGTGGTCAC CCGCCTGGAT
CACGATGCGA ACGTGGCGCC CTGGCAGGCG CTCGAAGAAC GTGGTGCGAT CATTCGCATG
GTCGATATCG ATGTGGAAGA TTGCACGCTC GACATGGCGG ACATGGCGCG CGCGATCAAT
TCCCGCACGA AGCTGGTTGC AGTCGGATAT GCATCGAATG CGGTTGGCAC AATCAACGAT
GTGGCCACCA TCACGCGGAT GGCACACGAC GTCGGTGCGC TGGTGTACAT CGATGCCGTC
CACTACGCCC CCCATGGTCC TATCGATGTG CGCGCGCTCG ACTGTGATTT TCTGGCATGC
TCACCCTACA AGTTTTTCGC ACCGCATATG GGAGCGCTTT ACGGCAAGCG CGAGCATCTG
GAACGTCTGC GTCCGTACAA AGTGCGTCCC GCTTCTGATG CGGTTCCCGA CCGCTGGGAG
ACCGGCACGA AAAACCACGA AGGGCTGGCG GGGGTCACGG CGGCAATCGA CTATCTGGCG
GAACTGGGTC GGCGGGTGAA ACCGACGACG ACGCGGCGCG CGGCGCTTGT GCAGGCGATG
GAGGCTATTC AGGCATATGA GCGCACTCTC TCACACCATC TGATCGCCGG TCTGCTCGCC
ATACCAGGAT TGACATTCTA CGGCATCAGC GATCCGGCGC GCTTTGCATG GCGCACACCA
ACCGTCGCGG TGCGTCTGGA GGGGAGCACT CCGCGCGAAC TTGCCAGGCG CCTGGGCGAT
CAGGGTATTT TCTGCTGGGA CGGCAACTAC TATGCGATCA ATCTGACAGA GCGCCTCGGC
GTCGAAGCAG ACGGCGGCAT GCTACGGATT GGACTGGTGC ACTACAATAC CGCAGAGGAG
ATCGATCGGT TGCTGGAGGT GATGAGGGGT TAG
 
Protein sequence
MSALDLTWIR AQFPALMQEM NGRPVVFFDG PGGTQVPRRV IDAMAEYLTL YNSNTHGAFA 
TSQRTDATVD AARVAMADFL GCDADEVVFG PNMTTLTFAI SRAFGRDIRP GDEIVVTRLD
HDANVAPWQA LEERGAIIRM VDIDVEDCTL DMADMARAIN SRTKLVAVGY ASNAVGTIND
VATITRMAHD VGALVYIDAV HYAPHGPIDV RALDCDFLAC SPYKFFAPHM GALYGKREHL
ERLRPYKVRP ASDAVPDRWE TGTKNHEGLA GVTAAIDYLA ELGRRVKPTT TRRAALVQAM
EAIQAYERTL SHHLIAGLLA IPGLTFYGIS DPARFAWRTP TVAVRLEGST PRELARRLGD
QGIFCWDGNY YAINLTERLG VEADGGMLRI GLVHYNTAEE IDRLLEVMRG