Gene Rcas_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3744 
Symbol 
ID5541246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4910332 
End bp4911600 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID640895855 
Productchlorophyllide reductase subunit Y 
Protein accessionYP_001433802 
Protein GI156743673 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR02015] chlorophyllide reductase subunit Y 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000125918 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGCCAA TCGCGCTGAG CGCTGAGCAG CCGCAGGGAC ACACGTGCAA ACTGCATCCG 
CAGTCGATGT GCCCGGCGTT CGGTTCGTTG CGCATTCTGA GCCGGATCGA GGGGTCGCAC
CCTGTAATGG CCACCGATAC GGGGTGTCTG TACGGCTTGA CCTTTGTGAC CCACTTCTAC
GGTGCGCGCA AGTCGATCCT GGCGCCGACG CTTGGCAGCG CCGAGTTATA CTCCGGCGAG
GTCGTCGAAG GGACGCGCAT GGCGATTGAG GCAGCGGCGC GTGAGCCAGG GTGCCGCCTG
GTGCCGGTCG TGTCGCTCTG TGTGGCTGAA ACGGCAGGCA TGCCGGAGGA ATTGCTGCCG
CGCAAGGTTG GCGATGCCGA GGTGGTGCTG GTGCGCGTGC CCGCCTATGC GATCCACTCA
CATCCCGAAG CCAAGGATGT GGCGCTCGAA GCGTTGCTGC GGCGATTGGG CGATCGCGAA
AGTCCACGCG AGGAACGCAC CGTTGTGGTG CTCGGCGAGG TCTTTCCCGC CGATCAACTG
GCGATTGACG CCATCCTGCG GCGTATGGGG GTCGAGGCGA CGATTGCGTT GCCGGGACGC
TCAATCGATG ACCTGCGGCG CGCCGGTCGC GCAGCGGTGC TGGCGCCGCT CCATCCGTTC
TATAAGGGCG TGACGCGCCT GTATCGTGAA TGGGGCGCAT CGGTGGCTGG CGGTGCGCCG
GTGGGCATTA GCGGCGCCTA TGCCTGGCTC AAGTCGATTG GCGCTTTGCT CGACCTCGAT
CCGGCGCTGG TTGATCGGGT GGCGGAAGAG GAACGCGAGA AGGCGCAGGC GGTGCTGGCG
GCAAAGTCGC TCAAGGGTGC GCGGGTGTTG GTGACCGGCT ACGAAGGCAC CGAACTGGCG
TATGCGCGGT TGCTGGTCGA GGCGGGCGCC GAGGTTCCGT ATGTCTCCAC CTCGATTGGC
GTCGATCCGC TGTCGCTCCC CGATGAACTG TGGTTGAAAG CGCGTGGGAC GCAGGAGGTT
GCCTATCGCA AGGCGCTGGA AGAGGATATG GCGGCGCTCG ACCGGTATGC GCCCGATTTC
GTATTGGGCA CGACGCCGTT TGCGGCTGCG GCGAAAGAGC GCGGCATTCC GGCGATGTAT
TTCACCAATC AACTGGCGTC GCGTCCCTTC TTCCTGAGCG GCGGCATGGC TGCGACGATT
GGGTTCGTGG CAGAAACGTT GGAGCGCGCC AACCGCTATC GTGAGATGCT TGCCTTTTTT
GCGGAGTGA
 
Protein sequence
MEPIALSAEQ PQGHTCKLHP QSMCPAFGSL RILSRIEGSH PVMATDTGCL YGLTFVTHFY 
GARKSILAPT LGSAELYSGE VVEGTRMAIE AAAREPGCRL VPVVSLCVAE TAGMPEELLP
RKVGDAEVVL VRVPAYAIHS HPEAKDVALE ALLRRLGDRE SPREERTVVV LGEVFPADQL
AIDAILRRMG VEATIALPGR SIDDLRRAGR AAVLAPLHPF YKGVTRLYRE WGASVAGGAP
VGISGAYAWL KSIGALLDLD PALVDRVAEE EREKAQAVLA AKSLKGARVL VTGYEGTELA
YARLLVEAGA EVPYVSTSIG VDPLSLPDEL WLKARGTQEV AYRKALEEDM AALDRYAPDF
VLGTTPFAAA AKERGIPAMY FTNQLASRPF FLSGGMAATI GFVAETLERA NRYREMLAFF
AE