Gene Rcas_0626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0626 
Symbol 
ID5538089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp825732 
End bp826733 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content61% 
IMG OID640892784 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001430770 
Protein GI156740641 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.464405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.107991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGA CCCGACGTGA ACGACTCGCT GCGGCGATCC GAGGCGAACC GGTTGATCGC 
CCACCTGTTG CGCTCTGGCG CCATTTTCCG GTAGATGATC AGGACCCCGA ACAACTGGCG
TTATCCGTGG CTGCGTTTCA GTCGCAGTAC GACTGGGATT TCGTCAAGTT CACACCATCG
AGCAGTTTTT GTGTCGAGAA TTGGGGATGC CGCGTCGTGT ACCGTGGGCA CTCCGAAGGA
ACCAGCGACT ACGTCGCGCG CCCGGTCAGC GTCCCTGCCG ACTGGCGGCG CATTACGCCG
CTCGACCCGC GCGCTGGCGC ACTTGGCGCG CATCTGGTAG CCGTCCGTCG TGCGCGCGCA
TTGATCGATC CCGACGTTCC CCTGCTGGCG ACAGTCTTCA GCCCGATCAG TCAGGCAAAG
AATCTGATCG GCGGAGGGAT GGACATTGTA CATCTCCGGC GTCATCGCTC CGATCTGCTG
GACGCGCTCG AAGCAATCAC AGAAACAACG ATACGCTTCG TCGAAGCCGT ACTCGAAACC
GGCGCCGACG GCATTTTCTA CGCAATGCAA CGATGTACAG CGGATGTCAT CAGCGAAGCC
GAATACCGCG AGGTCTGCCG TCCGCTTGAC ATGCGCATTC TCGAAGCGGC GCATGCAGCC
AGCGCAGCAC ATGGAAAACC GCCTTTCATT CTGCTCCACC TGCATGGTAT GCACTCCTAC
TTCGACATTG CAGCGGAATA TCCCGCGCAG GCGCTCAACT GGCACGACCG CGACACCGGA
CCCGACCTCG CTGAAGGCGC GCGCCGCTTT CCAGGCATGG TTGTTGGAGG TTTAAGCCAA
CGCGATATTG TCGAAGGTTC ACCCACGGCA GTGCAGTCGC TGGCGCGCCA GGCAATCGCA
GCGATGGGCG GACGGCGCAT GTGCCTTTCG ACCGGCTGTG TGATGCCGAC GACGGCGCCC
TGGGGGAACA TTCGCGCACT GCGAGATGTC GTGGGTCCAT GA
 
Protein sequence
MSMTRRERLA AAIRGEPVDR PPVALWRHFP VDDQDPEQLA LSVAAFQSQY DWDFVKFTPS 
SSFCVENWGC RVVYRGHSEG TSDYVARPVS VPADWRRITP LDPRAGALGA HLVAVRRARA
LIDPDVPLLA TVFSPISQAK NLIGGGMDIV HLRRHRSDLL DALEAITETT IRFVEAVLET
GADGIFYAMQ RCTADVISEA EYREVCRPLD MRILEAAHAA SAAHGKPPFI LLHLHGMHSY
FDIAAEYPAQ ALNWHDRDTG PDLAEGARRF PGMVVGGLSQ RDIVEGSPTA VQSLARQAIA
AMGGRRMCLS TGCVMPTTAP WGNIRALRDV VGP