Gene Rcas_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3850 
Symbol 
ID5541354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5032273 
End bp5033691 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content64% 
IMG OID640895960 
Producthypothetical protein 
Protein accessionYP_001433905 
Protein GI156743776 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.167663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAC GACTGCACGC CGGGGTCGCA CGAACGGATA TTACGCCGCC ACCAGGCATT 
GCCCATGCCG GTTGGGGAGC GCAGACCCAT CAGCGGGCCG CCGGAGTCGA TCTGCCGCTC
TGGGCGACGG CGCTGGCACT CTCTGATGGC ATCGAGACGG TTGTGATCGT CGATATCGAT
CTGGTTTACC TCTGGGATGC CGAAGCGCCC GGCGTCATGC GAGCCGTGGA GCAGGCGACC
GGTCTGCCAT CCTCACATAT CCGCCTTGCC TACACCCATA CCCACTCCGG TCCGATCAAC
GGCGCAACCT GGAGTTCCTG GGTGAAGGAG GGCGCCGAGA TGACGCCAGC CTATGATGCG
ATTCTGGAGC ATCATATCGC GGGCGTTGCC CGGCAAGCCT TGCAACGGAT GCGCCCGGTG
CGCATCGCTG CCGGTTCCGG CGAGGCGCGG ATCAATGTCA ATCGGCGCTT TCGGCGCCCT
GAAGATGGCA TGGTGGTTTG TGGGCGCAAT TGGGATGGTC CTGTTGATCA TCAGGTGCAG
GTGGTGCGGC TCGACGATCT GGAGGGGGCG CCACTGGCAG TGATCGTGAA CTATGCCTGC
CACCCGATTA CGGTCGGTCC TGATTGCGAC CTGATTACGC CCGATTATCC AGGGGTGATG
AAGCGCGTCG TCGAACAGTC CACCGGCGCC ACATGTCTGT TTCTTCAGGG AGCAGCCGGC
GATCTCGGAC CGATCCAGGG CGTCGCCCGC GGCGGTCTGG CTGAGTATCG GCGGTTGGGG
AGCATCCTGG GCCACGAAGT GAGCCGGATC TGGTGGGAAC TCGAACCGTG GCGGCGGCGT
GAGCGCTATG CCGGCACGCT GGAGTCCGGC GCGCCACTGG CGATCTACCA AGATGAGCGC
CTGCCCGACC TCGATACCAC GCTCCGGGTT GGCGTGCGCG AGGTGCAATT GCCGCTGAAA
CAGTTCGCCC CGGCTGCCGA GTTGGCGGCA GCGGCGGCGC AGCATATTGA GCGGCTCAAC
CGTCTGCGCG CCGAAGGCGG CGATACTGAG GACATCCGCA CCGAGACGAT GCTGGCGAAG
CGCGCCGGGA TGCGCGCCGA TCTTGCACGC CGCAATGAAG GTCATACCTA TCGCAGCGTG
ACCCTGCAAA CCTTCGCCAT CGGCAACCAG ATCGCTCTGC CAGCCGTACC CGGCGAGCCG
TTCTGCGAGA TCGGCAGGCG GGTGAAGGTC GGCTCGCCTT TCCCATACAC GCTCTTCTCC
GGTTACGCGA ATATCGGCTG GGCGTACATC CCCACTGCCG ACGCTTATCC GCTGGGCGGC
TATGAGATCG AGATTACACC GTTCGCGCCT GAAGCCGCCG ATATCCTGGT TGATGCAAGC
CTGACGTTGT TGCGTGATAT GCTGCCAGAG CGGCGTTGA
 
Protein sequence
MARRLHAGVA RTDITPPPGI AHAGWGAQTH QRAAGVDLPL WATALALSDG IETVVIVDID 
LVYLWDAEAP GVMRAVEQAT GLPSSHIRLA YTHTHSGPIN GATWSSWVKE GAEMTPAYDA
ILEHHIAGVA RQALQRMRPV RIAAGSGEAR INVNRRFRRP EDGMVVCGRN WDGPVDHQVQ
VVRLDDLEGA PLAVIVNYAC HPITVGPDCD LITPDYPGVM KRVVEQSTGA TCLFLQGAAG
DLGPIQGVAR GGLAEYRRLG SILGHEVSRI WWELEPWRRR ERYAGTLESG APLAIYQDER
LPDLDTTLRV GVREVQLPLK QFAPAAELAA AAAQHIERLN RLRAEGGDTE DIRTETMLAK
RAGMRADLAR RNEGHTYRSV TLQTFAIGNQ IALPAVPGEP FCEIGRRVKV GSPFPYTLFS
GYANIGWAYI PTADAYPLGG YEIEITPFAP EAADILVDAS LTLLRDMLPE RR