Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2971 |
Symbol | |
ID | 5540462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3854884 |
End bp | 3856116 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640895090 |
Product | cysteine desulfurase family protein |
Protein accession | YP_001433048 |
Protein GI | 156742919 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000129059 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000905656 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGCCC TCGATCTGAC CTGGATTCGT GCTCAGTTTC CTGCCTTGAT GCAGGAAATG AACGGTCGTC CCGTCGTGTT TTTCGACGGT CCTGGAGGAA CGCAGGTTCC CCGGCGGGTG ATTGACGCAA TGGCGGAGTA TCTGACTCTG TATAACTCGA ATACGCATGG CGCTTTTGCG ACCAGCCAGC GTACCGATGC AACGGTTGAC GCCGCGCGTG TTGCCATGGC TGATTTTCTG GGATGCGACG CTGATGAGGT GGTCTTTGGT CCGAACATGA CCACGCTGAC CTTTGCGATC AGCCGGGCAT TCGGGCGCGA TATCCGCCCC GGCGACGAGA TTGTGGTCAC CCGCCTGGAT CACGATGCGA ACGTGGCGCC CTGGCAGGCG CTCGAAGAAC GTGGTGCGAT CATTCGCATG GTCGATATCG ATGTGGAAGA TTGCACGCTC GACATGGCGG ACATGGCGCG CGCGATCAAT TCCCGCACGA AGCTGGTTGC AGTCGGATAT GCATCGAATG CGGTTGGCAC AATCAACGAT GTGGCCACCA TCACGCGGAT GGCACACGAC GTCGGTGCGC TGGTGTACAT CGATGCCGTC CACTACGCCC CCCATGGTCC TATCGATGTG CGCGCGCTCG ACTGTGATTT TCTGGCATGC TCACCCTACA AGTTTTTCGC ACCGCATATG GGAGCGCTTT ACGGCAAGCG CGAGCATCTG GAACGTCTGC GTCCGTACAA AGTGCGTCCC GCTTCTGATG CGGTTCCCGA CCGCTGGGAG ACCGGCACGA AAAACCACGA AGGGCTGGCG GGGGTCACGG CGGCAATCGA CTATCTGGCG GAACTGGGTC GGCGGGTGAA ACCGACGACG ACGCGGCGCG CGGCGCTTGT GCAGGCGATG GAGGCTATTC AGGCATATGA GCGCACTCTC TCACACCATC TGATCGCCGG TCTGCTCGCC ATACCAGGAT TGACATTCTA CGGCATCAGC GATCCGGCGC GCTTTGCATG GCGCACACCA ACCGTCGCGG TGCGTCTGGA GGGGAGCACT CCGCGCGAAC TTGCCAGGCG CCTGGGCGAT CAGGGTATTT TCTGCTGGGA CGGCAACTAC TATGCGATCA ATCTGACAGA GCGCCTCGGC GTCGAAGCAG ACGGCGGCAT GCTACGGATT GGACTGGTGC ACTACAATAC CGCAGAGGAG ATCGATCGGT TGCTGGAGGT GATGAGGGGT TAG
|
Protein sequence | MSALDLTWIR AQFPALMQEM NGRPVVFFDG PGGTQVPRRV IDAMAEYLTL YNSNTHGAFA TSQRTDATVD AARVAMADFL GCDADEVVFG PNMTTLTFAI SRAFGRDIRP GDEIVVTRLD HDANVAPWQA LEERGAIIRM VDIDVEDCTL DMADMARAIN SRTKLVAVGY ASNAVGTIND VATITRMAHD VGALVYIDAV HYAPHGPIDV RALDCDFLAC SPYKFFAPHM GALYGKREHL ERLRPYKVRP ASDAVPDRWE TGTKNHEGLA GVTAAIDYLA ELGRRVKPTT TRRAALVQAM EAIQAYERTL SHHLIAGLLA IPGLTFYGIS DPARFAWRTP TVAVRLEGST PRELARRLGD QGIFCWDGNY YAINLTERLG VEADGGMLRI GLVHYNTAEE IDRLLEVMRG
|
| |