Gene Rcas_3746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3746 
Symbol 
ID5541248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4914721 
End bp4915899 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content59% 
IMG OID640895857 
Productchlorophyllide reductase iron protein subunit X 
Protein accessionYP_001433804 
Protein GI156743675 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR02016] chlorophyllide reductase iron protein subunit X 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0605815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000012212 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCCAC GCATGATCGC AATTTACGGC AAAGGCGGGA TGGGGAAGAG TTTCTTCACC 
TCCAATCTGA CCGCGCGACT GGCATACGAT GGCTATCGGG TGTTACAACT CGGCTGCGAC
CCGAAGCACG ACTCGTGCAA CACGATCTTC GGTGGGCATT CGTTGCCGAC ATTAGGCGAT
CAATGGCGCC TGTTCAGAGA GGCGGGTAAA GAGGATCAGT TGTCGATTGG CGACGTGATC
TTCCGCAATG AATTGCGCCC CGGCGTCGTC ATCTTTGGCT GTGAACTCGG CGGTCCCGAG
GTCGGGCGTG GTTGCGGCGG GCAAGGAATC TCAACCGGCT TCAAGGTGCT CGAAAACCTG
GGAATGAGCC GCTGGAACCT CGACTTCATC GTAATGGACT TTCTTGGTGA TGTGGTGTGC
GGCGGGTTCG CTACACCACT CGCTCGTTCG CTTGCCGAGC AGGTGATCAT TCTCGTCGGA
CACGACCGCC AATCGCTCTA CGCAGCGAAC AATATTGCGC GCGCGGCGCA GTATTTCCGC
TCGATGGGCG GCACGACCCA GATCCTGGGG TTGGTGGTGA ACCGTGATGA TGGCAGCGAT
ACCGCCGATC AGTATGCTGC GGCAGTCGGG TTGCCGATCC TGACGCGCGT GCCGTTGAGC
CGCAAGGTGC GTGAACTTGC CGATGCCTGT CGCCTGGCGC TGGAGGACGA ACAGTTCAAT
CAAATCTTCG GCGATCTGGC GAAGCGTATT GCGCGCGGAG AATTGCAACC GGTCGATACG
TATACGCCGC TGAGTTATGA TGAGTTTCTG CGCGTGTTTG GCGCCGAAGA GCCGCCAGGA
AGACCGGACT CGGCGCGCGC CGAGGACCTG TTCGGCGAGA AGACGCCGGT CTTTACCGTG
CCTATCCTGT CGCTCAAGCC GGTCATTCCA CAGGTGCAGG TGACCGATCC GGTGCAACTC
AAAGTGCAGC AGATGATCGA AGCAATCGGT ATGTATGTCA CCGATATGGT GCGTAGTGAG
CGCGATGGCA TCACCGTCAC CTCTGGTTCG GTCGAAATTC GATTGGGAGA GCCGCAGGAC
CTTGAACATA AAGTGGCGTT CCTTTCAGCG CTGCGGCGGT CAGGACAGGC GTTCAGTTTC
GTCGATCTGC GCTACGCCGA CGCGCCGACG TATCGGTGA
 
Protein sequence
MAPRMIAIYG KGGMGKSFFT SNLTARLAYD GYRVLQLGCD PKHDSCNTIF GGHSLPTLGD 
QWRLFREAGK EDQLSIGDVI FRNELRPGVV IFGCELGGPE VGRGCGGQGI STGFKVLENL
GMSRWNLDFI VMDFLGDVVC GGFATPLARS LAEQVIILVG HDRQSLYAAN NIARAAQYFR
SMGGTTQILG LVVNRDDGSD TADQYAAAVG LPILTRVPLS RKVRELADAC RLALEDEQFN
QIFGDLAKRI ARGELQPVDT YTPLSYDEFL RVFGAEEPPG RPDSARAEDL FGEKTPVFTV
PILSLKPVIP QVQVTDPVQL KVQQMIEAIG MYVTDMVRSE RDGITVTSGS VEIRLGEPQD
LEHKVAFLSA LRRSGQAFSF VDLRYADAPT YR