Gene RoseRS_4141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4141 
Symbol 
ID5211125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5186956 
End bp5188386 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID640597730 
Productpolysulphide reductase, NrfD 
Protein accessionYP_001278435 
Protein GI148658230 
COG category[C] Energy production and conversion 
COG ID[COG5557] Polysulphide reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAC AACCCGCACA GAAATCGGCG TATGGCAAGA TGCTTGAGGA GTTGCTGGGA 
CCGAAGCAGA GTTATGAATC GGTCACCAGA ACCATTGGCG ACATTGTGCT GACGCCGCTC
AAGCGAACCC CGTGGGGTTG GGCGTTGGGG TTCGTCCTTG CAGCCCTGGG ATTGCTGATG
TACCTCTACT CGCTGGCGGT GCTGTTTACC GTCGGCGTTG GCATCTGGGG GATTAATATC
CCGGTTGCCT GGGGCTTCGA TATTATCAAC TTCGTCTGGT GGATCGGCAT CGGGCACGCC
GGGACGCTCA TCTCGGCGAT TCTGCTCCTC TTCCGGCAGG ACTGGCGCAC CTCGATCAAC
CGTGCTGCCG AGGCGATGAC GATCTTCGCC GTTGCGTGCG CCGGTATCTA CCCGCTGGTG
CATACGGGCC GCCCCTGGCT CGATTACTGG ATGCTCCCCT ATCCTGGCAC GCTCGGTATG
TGGCCGCAGT TCCGCAGCGC TCTGGAATGG GACGTGTTTG CGATCTCGAC GTATGCCACG
GTCTCAATCC TGTTCTGGTA TGTCGGTCTC ATTCCCGACC TTGCTTCGCT GCGCGATCGG
GCGACGAATA AGTGGGTCAA GATCTTCTAT GGCTTCCTGG CGCTCGGCTG GCGCGGCGGC
GCCCGCGACT GGCATCGCTA TGAGATGGCG TCGCTCATTC TGGCAGGGCT TTCGACACCG
CTGGTGCTGT CGGTGCACAG TATCATCAGC CTGGACTTCG CCATCTCACA GTTGCCCGGC
TGGCACGTGA CGGTCTTCCC GCCCTACTTC GTTGCCGGTG CAGTCTACTG CGGCTTCGCA
ATGGTGATCC TGCTGCTGAT ACCAATGCGC CGTTGGTACA AACTGCACGA TCTGATCACG
ATGAAGCACT TCGACCTGAT GGGCAAGGTG ATGCTGGCGT CAGGTCTGGT GGTGGCGTAT
GGCTATTTCG GTGAAATGTT CTATGCCTGG TACAGCGCCA ATATCTACGA GTACTTCCTG
ATCACGAACC GCACGATGGG TCCGTACGCC TGGAGTTACT GGGCGCTGAT CGTGCTGAAT
GTCGCCATTC CGCAACTGTT GTGGTTCAAG CGCTTCCGCG TCAGCCTGCC CTGGCTCTTC
TTCATCTCGA TCTGTATCAA TATCGGGATG TGGTTCGAGC GCTGGGTGAT CATCGTGCTT
AGCCTGCACC GCGACTTTAT GCCAGCGTCG TGGGGCTACT ACACGCCGAG TGTGTGGGAT
ATCTCACTGT ACGCCGGTTC GTTCGGATGG TTCTTCTTCC TGTTCTTCCT GTTCATCCGC
TTGTTGCCGG CGATCTCGAT CTTCGAGGTG CGCGACCTGG TGCATAAGAT CGAGGCAGAA
CAGCACGCGC CGGTCCAGGT CGGCGGCGCC GGACACGTCA GGGAGGCGTA G
 
Protein sequence
MASQPAQKSA YGKMLEELLG PKQSYESVTR TIGDIVLTPL KRTPWGWALG FVLAALGLLM 
YLYSLAVLFT VGVGIWGINI PVAWGFDIIN FVWWIGIGHA GTLISAILLL FRQDWRTSIN
RAAEAMTIFA VACAGIYPLV HTGRPWLDYW MLPYPGTLGM WPQFRSALEW DVFAISTYAT
VSILFWYVGL IPDLASLRDR ATNKWVKIFY GFLALGWRGG ARDWHRYEMA SLILAGLSTP
LVLSVHSIIS LDFAISQLPG WHVTVFPPYF VAGAVYCGFA MVILLLIPMR RWYKLHDLIT
MKHFDLMGKV MLASGLVVAY GYFGEMFYAW YSANIYEYFL ITNRTMGPYA WSYWALIVLN
VAIPQLLWFK RFRVSLPWLF FISICINIGM WFERWVIIVL SLHRDFMPAS WGYYTPSVWD
ISLYAGSFGW FFFLFFLFIR LLPAISIFEV RDLVHKIEAE QHAPVQVGGA GHVREA