Gene Rcas_1537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1537 
Symbol 
ID5539013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1961008 
End bp1962699 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content62% 
IMG OID640893675 
Productlight-independent protochlorophyllide reductase subunit B 
Protein accessionYP_001431648 
Protein GI156741519 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01278] light-independent protochlorophyllide reductase, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.61403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000999524 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACTGG CTCTCTGGAT GTATCAGGGC ACTGCCCATC ACGGTGTCGG ACGGATCGCC 
AACAGTATGC GCGGGGTGCA TGCGGTCTTT CATGCGCCGC AGGGCGACGA CTACGTCAAT
CCGATCTTCA CGATGCTGGA GCGCACCCCC GACTTTCCGC GCATGACGAC CAGCATCGTC
AGCGGGCGCG ACCTGGCGCA GGGAACCGTG CGATTGCCCG AAACGCTCCG GCAGGTCGAT
GCGCAGGTGC AACCGGATCT GATTATCGTC TGCGCCAGTT GCTCGACCAT CCTCTTGCAG
GAGGACCTGG AGCGGATGGC GCATAGCGCC GGAACGCGCG CCGAGACGCT GGTGTACGAT
GCCAATCCCT ATCGTATGCA GGAAGTTCGC TCCGCCGACG GGTTGTTCAC TCTTCTGACA
CAACGCTTTG CCCGTTCTCA ACCGCCGACG GCAGTCCCAA GCGTCAATAT TCTCGGTCCG
GCATCGCTCG GCTTCCACAA TCGCAGCGAT CTGATCTGCC TGCGGCGGAT GCTGGCGACC
CTCGGCGTGC AGGTGAATGT CGTCGCGCCC CTTGGCGCAT CGATCCGCGA CCTGGAGCGC
CTTCCGGCAG CCTGGGCAAC GATCGCGCCC TACCGCGAAC TGGGACAAAA CGCCGCGCGC
TGGCTCGACG AGCAGTTTGG CGTTCCGGCG CTCACCGATT CGCCTATCGG CGTGCAACCG
ACCCTGCGCT GGCTACGGCG TCTGGTCGAG ACCCTCAACG ATGCCGGTGA GCGGTTGCAG
CGCCTGACAA ACCCGTTGCG CCTGCCGCCG TTGACTGCCT TCTCGCTCGA TGGCATGAGT
GCGCCGAGTT CGGTCCCCTG GTTTGCGCGC ACTGCCGATA TGGAAAGTTA CAGCATGAAG
CGCGCCTTCG TCTTTGGCGA TGCCACTCAT ACCGTCGGTA TGGTGAAGTT CCTGCGCGAT
GAACTGGGCA TGCAGATCGT TGGGGCGGGG ACCTATCTGG AGCACGAAGC GGACTGGGTG
CGTGGGGAAC TTCAGGACTA CCTGCCCGCC GACGAGACCG GATCGATAGA CACTTCGTTT
CTGGTGACCG AGGTGTTTCA GGATGTCGCG CGGCGTATCG CCGATCTCAC GCCCGAACTG
GTCTGTGGCA CCCAGATGGA ACGCCACGCC TGCCGCAAAC TCGACCTTCC GTGCATGGTC
ATTGCCCCGC CGACGCATAT CGAAAATCAT CTTCTGAGTT ACCGACCGGT GCTTGGCTTT
GATGGCGCCG ATGTGCTGGC GGATACTGTC TACACCACGG CGACACTGGG CATGGAGAAA
CATCTGATCG ACATGTTTGG TGATGCCGGG CTGGAGTACG AGGAACCGAG AACTGAGCGC
CGAGAAGCGG AATTCGGGAA CCAGAAGGTG GAAACTGGTG AACCAGGAAC CGGAGCGCCG
GTGATTGCCC ACGCGGATTC GAACGGTGGC GTTGCCGGTT CGTCGAGCAC TCTCGCTGCT
CAGACGGTCA CAGCATCACC CCGGCTCGTC ACGCCGGTCT GGGCGCCGGA GGCGCAGGCG
ATGCTCAAGA AGGTGCCGTT TTTCGTGCGA GGACGGGTGC AGAAGAATGT CGAACGGTAC
GCAGCGCAAC ACGGGTATGC AACGATAACC GCCGAGATAC TGGTGGAAGC GAAAGAGGCG
CTTGGCGGTT GA
 
Protein sequence
MRLALWMYQG TAHHGVGRIA NSMRGVHAVF HAPQGDDYVN PIFTMLERTP DFPRMTTSIV 
SGRDLAQGTV RLPETLRQVD AQVQPDLIIV CASCSTILLQ EDLERMAHSA GTRAETLVYD
ANPYRMQEVR SADGLFTLLT QRFARSQPPT AVPSVNILGP ASLGFHNRSD LICLRRMLAT
LGVQVNVVAP LGASIRDLER LPAAWATIAP YRELGQNAAR WLDEQFGVPA LTDSPIGVQP
TLRWLRRLVE TLNDAGERLQ RLTNPLRLPP LTAFSLDGMS APSSVPWFAR TADMESYSMK
RAFVFGDATH TVGMVKFLRD ELGMQIVGAG TYLEHEADWV RGELQDYLPA DETGSIDTSF
LVTEVFQDVA RRIADLTPEL VCGTQMERHA CRKLDLPCMV IAPPTHIENH LLSYRPVLGF
DGADVLADTV YTTATLGMEK HLIDMFGDAG LEYEEPRTER REAEFGNQKV ETGEPGTGAP
VIAHADSNGG VAGSSSTLAA QTVTASPRLV TPVWAPEAQA MLKKVPFFVR GRVQKNVERY
AAQHGYATIT AEILVEAKEA LGG