Gene Rcas_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4179 
Symbol 
ID5541690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5408961 
End bp5410016 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content64% 
IMG OID640896290 
Productoxidoreductase molybdopterin binding 
Protein accessionYP_001434228 
Protein GI156744099 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0370615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000255221 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGACCAC GCACAACAGA CTGGAGCCTG GCGCTGGCAA CCGGCGCTGC GTTTGCGACC 
GGTCTCTGGA CGTTAACCAC CGGGCAAGTT GAAGGATGGT GGGTGTTTGC GCTCCATGGC
GCGACCGGTT ATCTGACGTT TCTGCTTCTG ATCCCCAAAC TTGTGCGCGT TCGCAACCGT
CTGCTGCCCG GAATTCGGTC TCCCCGCGCC TGGGCTGGGC TGGCAACCAC GGCGCTGGCG
CTTCTCACGC TTGTCTGCGG CATTGCCTGG GTGAGCGGCG GCGGCATCGT TGTGCTCGGC
TACAATCTGC TTAACTGGCA TATTCTGTTC GGTCTGGTGT TGACCGTGCT TCTTTCAGCG
CACATGGTTG TTCGCGCAAA ACCGCTCCGC ACGGAAGACC GTTCGCGACG GCAGGCGCTG
CGCGCAGGGG CATTTGCGCT AGGGGCAGCG TTGATCTGGC CCCTTCAGGA GCGTCTCATT
GGTACATTGG GGCTGCCGGG AGCGCAACGG CGCTTCACCG GTTCACGTGA GGTCGCCAGT
TTCAGCGGTA ATGGATTTCC AATCGTCAGT TGGATGGCGG ATCGCCCTGC GCCGCTGGAC
GTGGCGACCT GGCGTTTGCG CGTGACCGGT CTGGTTAGCG AGTCATTCGC CGTCAGCCAC
GATGAACTCG ATGCGCGCGA TGAACTGACG GCAACGCTCG ATTGCACTGG CGGCTTCTAC
ACCACGCAGC ATTGGCGCGG CACACGGGTC GGCGCGCTGC TCGACCGCGC CGGGGTGCTG
CCGGAGGCGC GCTGGGTGCG GTTTGTGTCG GTCACAGGCT ATCGCTGGAG CCTGCCGCTG
GAACAGGCGC GCGAGACGTT GATCGCAGTG CGGGTTGGCG GCGAACCGCT CAGCCACGGG
CACGGCGCGC CTGCCCGCCT CGTCGCTCCC GGCGAACGCG GGTTTGTGTG GGTTAAGTGG
CTGGCGCTCA TCGACGTGCG CGCCGAGCCG GACCCGGCTC AATTGGTGGC GATCAATGTG
AGCGGGTTTG TTGCCTCGGA TGATGTTGGG GGATGA
 
Protein sequence
MRPRTTDWSL ALATGAAFAT GLWTLTTGQV EGWWVFALHG ATGYLTFLLL IPKLVRVRNR 
LLPGIRSPRA WAGLATTALA LLTLVCGIAW VSGGGIVVLG YNLLNWHILF GLVLTVLLSA
HMVVRAKPLR TEDRSRRQAL RAGAFALGAA LIWPLQERLI GTLGLPGAQR RFTGSREVAS
FSGNGFPIVS WMADRPAPLD VATWRLRVTG LVSESFAVSH DELDARDELT ATLDCTGGFY
TTQHWRGTRV GALLDRAGVL PEARWVRFVS VTGYRWSLPL EQARETLIAV RVGGEPLSHG
HGAPARLVAP GERGFVWVKW LALIDVRAEP DPAQLVAINV SGFVASDDVG G