Gene Rcas_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4042 
Symbol 
ID5541553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5243148 
End bp5244089 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content62% 
IMG OID640896155 
Productradical SAM domain-containing protein 
Protein accessionYP_001434093 
Protein GI156743964 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00822978 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGCTT CATCCGCCGG ATCGTGCCAG GGCGTCTGTT TTCCCAAGGT TGATCTGAGC 
ACGCATCCGT GTTATAGCCG GGCAGCGCAT TTCCGCTTCG GACGCATCCA CGTGCCGGTG
GCGCCCCGTT GCAACATCCA GTGCAACTAT TGCATTCGCA AATACGCATG CCCAAACGAA
AATCGTCCCG GCGTCACCAT GCGGGTCATG TCGCCCGACG AGGCGCTCCA CACCGTCCGC
CACGCCATCG CCCACGACCT GCGCCTCCGC GTCCTTGGCG TGGCCGGTCC AGGCGATGCG
CTGGCAAATC AGGCGACGCT CACTACGTTC GAGCGCGCGC GCGCTGAGTT TCCGCACCTG
ATCCGCTGCC TCTCGACCAA CGGCTTGCTG CTGCCGGATC AGATCGACGC CATCGAGCGG
GCCGGGATCA CCACGCTGAC GATCACGATC AATGCGGTCG ATCCGGCAAT TGGCAAGCAG
ATATACGCCC ATGTGCGCTA TCGGGGCAAA ACCTACCGCG GGCGCGAGGC AAGCACACTG
CTCCTGCACA ATCAGTTGAT CGGACTGCGC GAAGCGGCAC TGCGCGGTAT AGTGGTTAAG
GTGAACTCAG TGCTCATTCC CGGTATCAAC GATCATCACC TGATCGACGT CGCGTGCGTC
GTCAAAGATC ACGGCGCCTC CATCATGAAC ATCATCCCGC TCATACCCCT GGCGAAGTTT
GCGCACCTGC CGGAACCCTC ACCCGAACTG CTCAACCGGG TGCGCGACGA GTGCGCGACC
GTCATCGAAC AGTTTCGCCA CTGCCAGCGA TGCCGCGCCG ATGCCATCGG CGTTCCCGGC
GAGGAGGGGT GCGGTACGGG CGAACGAGTC TGTATTCCCA GATTTTTGGC ACAGCGGAAG
GAGCACACCC ATGCAGTTCA AGTGCAATCA GACCCTGCCT GA
 
Protein sequence
MDASSAGSCQ GVCFPKVDLS THPCYSRAAH FRFGRIHVPV APRCNIQCNY CIRKYACPNE 
NRPGVTMRVM SPDEALHTVR HAIAHDLRLR VLGVAGPGDA LANQATLTTF ERARAEFPHL
IRCLSTNGLL LPDQIDAIER AGITTLTITI NAVDPAIGKQ IYAHVRYRGK TYRGREASTL
LLHNQLIGLR EAALRGIVVK VNSVLIPGIN DHHLIDVACV VKDHGASIMN IIPLIPLAKF
AHLPEPSPEL LNRVRDECAT VIEQFRHCQR CRADAIGVPG EEGCGTGERV CIPRFLAQRK
EHTHAVQVQS DPA