Gene Rcas_3754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3754 
Symbol 
ID5541256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4924518 
End bp4926443 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content60% 
IMG OID640895865 
Productbifunctional photosynthetic reaction center subunit L/M 
Protein accessionYP_001433812 
Protein GI156743683 
COG category 
COG ID 
TIGRFAM ID[TIGR01115] photosynthetic reaction center M subunit
[TIGR01157] photosynthetic reaction center L subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0347506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.005487 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGCTG TGCCACGCGC GCTGCCGCTG CCGAGTGGAG AGACTCTCCC CGCTGAGGCG 
ATTTCCTCCA CAGGCTCGCA GGCAGCATCC GCAGAGGTTA TCCCATTCTC GATTATCGAG
GAGTTCTATA AGCGGCCCGG AAAAACCCTC GCGGCGCGCT TCTTCGGCGT CGATCCGTTC
GATTTCTGGA TTGGCCGCTT CTATGTTGGG TTGTTCGGCG CCATCTCGAT CATCGGCATC
ATTCTTGGGG TGGCGTTCTA CCTGTACGAG GGGGTGGTCA ACGAGGGAAC GCTCAACATT
CTGGCGATGC GCATCGAGCC GCCGCCAGTG TCGCAGGGCT TGAACGTCGA TCCTGCGCAG
CCCGGCTTCT TCTGGTTCCT GACGATGGTT GCAGCGACCA TCGCCTTTGT CGGTTGGTTG
CTGCGGCAGA TCGATATCAG CCTGAAGCTT GATATGGGCA TGGAGGTGCC GATTGCCTTC
GGCGCGGTTG TGTCCTCCTG GATCACGCTT CAGTGGTTGC GCCCGATTGC GATGGGCGCC
TGGGGGCATG GCTTCCCGCT CGGCATTACC CACCACCTCG ATTGGGTGTC GAATATCGGC
TATCAGTATT ACAATTTCTT TTATAATCCG TTCCACGCCA TCGGCATTAC GCTGCTCTTC
GCCTCAACGC TGTTCCTCCA TATGCACGGG TCGGCGGTGC TGAGCGAGGC GAAACGCAAC
ATCTCCGATC AGAACATCCA CGTCTTCTGG CGTAACATCC TTGGCTACAG CATCGGCGAG
ATCGGCATCC ACCGCGTTGC GTTCTGGACG GGCGCAGCAT CGGTGCTCTT CTCGAACCTG
TGCATCTTCC TGTCCGGCAC GTTCGTGAAG GATTGGAACG CTTTCTGGGG CTTTTGGGAC
AAGATGCCGA TCTGGAATGG CGTCGGTCAG GGAGCGCTGG TGGCCGGTCT GTCGCTGTTG
GGCGTCGGGC TGGTGCTGGG GCGCGGTCGT GAGACGCCGG GTCCGATCGA TCTGCACGAC
GAGGAGTATC GCGACGGTCT TGAAGGGACG ATTGCCAAGC CGCCGGGTCA TGTCGGCTGG
ATGCAGCGTC TGCTGGGTGA AGGACAGGTG GGTCCGATCT ATGTCGGGTT GTGGGGCGTC
ATTTCGTTCA TCACCTTCTT CGCCAGTGCG TTCATCATTC TGGTAGATTA TGGCCGTCAG
GTGGGGTGGA ACCCGATCAT CTATCTGCGC GAGTTTTGGA ACCTGGCCGT CTATCCACCG
CCGACCGAGT ATGGTCTGAG CTGGAATGTG CCATGGGACA AGGGCGGCGC ATGGCTGGCG
GCGACGTTCT TCCTGCACAT TTCGGTGCTG ACGTGGTGGG CGCGCCTCTA TACCCGCGCG
AAAGCGACCG GCGTCGGCAC GCAGCTGGCG TGGGGGTTCG CTTCGGCGCT GTCGCTCTAC
TTCGTGATCT ATCTGTTCCA TCCGCTGGCG CTCGGTAACT GGAGCGCCGC GCCGGGCCAC
GGCTTCCGCG CGATCCTGGA CTGGACGAAC TATGTGAGCA TCCACTGGGG CAACTTCTAC
TACAACCCCT TCCATATGCT CTCGATCTTC TTCCTGCTCG GATCGACGCT GTTGCTGGCG
ATGCATGGGG CGACGATCGT CGCAACCTCG AAGTGGAAGT CGGAGATGGA GTTCACCGAG
ATGATGGCGG AAGGTCCCGG AACGCAGCGC GCGCAACTCT TCTGGCGCTG GGTGATGGGC
TGGAACGCCA ACTCGTACAA CATTCACATC TGGGCATGGT GGTTCGCGGC GTTCACCGCG
ATTACCGGCG CGATTGGCCT GTTCTTGAGC GGTACGCTCG TGCCCGACTG GTATGCCTGG
GGTGAAACGG CCAAGATTGT GGCGCCCTGG CCGAACCCCG ATTGGGCGCA GTATGTCTTC
CGGTAA
 
Protein sequence
MSAVPRALPL PSGETLPAEA ISSTGSQAAS AEVIPFSIIE EFYKRPGKTL AARFFGVDPF 
DFWIGRFYVG LFGAISIIGI ILGVAFYLYE GVVNEGTLNI LAMRIEPPPV SQGLNVDPAQ
PGFFWFLTMV AATIAFVGWL LRQIDISLKL DMGMEVPIAF GAVVSSWITL QWLRPIAMGA
WGHGFPLGIT HHLDWVSNIG YQYYNFFYNP FHAIGITLLF ASTLFLHMHG SAVLSEAKRN
ISDQNIHVFW RNILGYSIGE IGIHRVAFWT GAASVLFSNL CIFLSGTFVK DWNAFWGFWD
KMPIWNGVGQ GALVAGLSLL GVGLVLGRGR ETPGPIDLHD EEYRDGLEGT IAKPPGHVGW
MQRLLGEGQV GPIYVGLWGV ISFITFFASA FIILVDYGRQ VGWNPIIYLR EFWNLAVYPP
PTEYGLSWNV PWDKGGAWLA ATFFLHISVL TWWARLYTRA KATGVGTQLA WGFASALSLY
FVIYLFHPLA LGNWSAAPGH GFRAILDWTN YVSIHWGNFY YNPFHMLSIF FLLGSTLLLA
MHGATIVATS KWKSEMEFTE MMAEGPGTQR AQLFWRWVMG WNANSYNIHI WAWWFAAFTA
ITGAIGLFLS GTLVPDWYAW GETAKIVAPW PNPDWAQYVF R