Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0626 |
Symbol | |
ID | 5538089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 825732 |
End bp | 826733 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640892784 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001430770 |
Protein GI | 156740641 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.464405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.107991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGA CCCGACGTGA ACGACTCGCT GCGGCGATCC GAGGCGAACC GGTTGATCGC CCACCTGTTG CGCTCTGGCG CCATTTTCCG GTAGATGATC AGGACCCCGA ACAACTGGCG TTATCCGTGG CTGCGTTTCA GTCGCAGTAC GACTGGGATT TCGTCAAGTT CACACCATCG AGCAGTTTTT GTGTCGAGAA TTGGGGATGC CGCGTCGTGT ACCGTGGGCA CTCCGAAGGA ACCAGCGACT ACGTCGCGCG CCCGGTCAGC GTCCCTGCCG ACTGGCGGCG CATTACGCCG CTCGACCCGC GCGCTGGCGC ACTTGGCGCG CATCTGGTAG CCGTCCGTCG TGCGCGCGCA TTGATCGATC CCGACGTTCC CCTGCTGGCG ACAGTCTTCA GCCCGATCAG TCAGGCAAAG AATCTGATCG GCGGAGGGAT GGACATTGTA CATCTCCGGC GTCATCGCTC CGATCTGCTG GACGCGCTCG AAGCAATCAC AGAAACAACG ATACGCTTCG TCGAAGCCGT ACTCGAAACC GGCGCCGACG GCATTTTCTA CGCAATGCAA CGATGTACAG CGGATGTCAT CAGCGAAGCC GAATACCGCG AGGTCTGCCG TCCGCTTGAC ATGCGCATTC TCGAAGCGGC GCATGCAGCC AGCGCAGCAC ATGGAAAACC GCCTTTCATT CTGCTCCACC TGCATGGTAT GCACTCCTAC TTCGACATTG CAGCGGAATA TCCCGCGCAG GCGCTCAACT GGCACGACCG CGACACCGGA CCCGACCTCG CTGAAGGCGC GCGCCGCTTT CCAGGCATGG TTGTTGGAGG TTTAAGCCAA CGCGATATTG TCGAAGGTTC ACCCACGGCA GTGCAGTCGC TGGCGCGCCA GGCAATCGCA GCGATGGGCG GACGGCGCAT GTGCCTTTCG ACCGGCTGTG TGATGCCGAC GACGGCGCCC TGGGGGAACA TTCGCGCACT GCGAGATGTC GTGGGTCCAT GA
|
Protein sequence | MSMTRRERLA AAIRGEPVDR PPVALWRHFP VDDQDPEQLA LSVAAFQSQY DWDFVKFTPS SSFCVENWGC RVVYRGHSEG TSDYVARPVS VPADWRRITP LDPRAGALGA HLVAVRRARA LIDPDVPLLA TVFSPISQAK NLIGGGMDIV HLRRHRSDLL DALEAITETT IRFVEAVLET GADGIFYAMQ RCTADVISEA EYREVCRPLD MRILEAAHAA SAAHGKPPFI LLHLHGMHSY FDIAAEYPAQ ALNWHDRDTG PDLAEGARRF PGMVVGGLSQ RDIVEGSPTA VQSLARQAIA AMGGRRMCLS TGCVMPTTAP WGNIRALRDV VGP
|
| |