Gene Rcas_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3798 
Symbol 
ID5541300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4969544 
End bp4970647 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content63% 
IMG OID640895908 
ProductNHL repeat-containing protein 
Protein accessionYP_001433855 
Protein GI156743726 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.88272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGCC CACTCGTCTG GATCGGCGCG CGTTCGCCTG GTGCATTGAC GCTTCCCCCT 
GCCGACCCGA CGCCGTCGCA GTTGTACGCA CCACGTGGCG TCTATCTCGA TGATCAGACG
CTGATTGTCG CGGATTCCGG CAACCATCGC GTGCTCATCT GGCATCGCAT CCCTGATCGG
GACGGACAGC CGGCTGATGT GGTGCTGGGA CAACCCGATT TCTACAGCGA GGGACCGCAG
GCTGCCGGGC GCGGCTCGCG GCACGGGATG CACCTGCCGA CCGGTGTGAT GGTGATCGAT
GGACGGTTGT GCGTCGCCGA TTCGTGGAAC CACCGCATTC TGGTGTGGAA TCGCGTTCCT
GAGACCTCGA ACGCTCCGCC GGATAGGGTC ATCGGGCAGG CGGACCTGGA CGAATGCGAG
CCGAATCGTG GCGGCGGCGT CACAGGGTGT GGATTCTATT GGCCCTACGG AATCGGATGG
GTTGCCGGTC GTTTTTACGT CGCCGATACC GGCAACCGCC GCGTCCTCAG TTGGAACAGT
ATTCCTGAAG ACAGACAGCC GCCCGACCTG GTGCTGGGTC AGAACGACGA GTGCAGCCAT
GCCGAAAATC GTGGCGAAGG GCCGTCGCCA TGTTCGTTTC GCTGGCCCCA CGCGATTGCC
GGCAATGGCA CAACCCTGTA TGTCGCCGAT GCCGGCAACC ACCGTGTGCT CGGCTGGACG
CCAATCCCCG CGCGCGATAC ACCGGCGTGC CTGGCGCTCG GTCAACGCGA CTTTCAGAGC
GCGTGGGAGA TGCCCCACAC GCCACCGGGA CCTTCCGCAC TCCGCTTCCC GTATGCGGTC
GCGTGCGCAT CCGGCAGGCT GATCGTCGCT GATACTGCCA ATAATCGGGT GCTGATGTGG
CACACGTTGC CGCGTGCAGG CGTGTTTCTG CCCGCCGATA TGGTCATCGG GCAGCCGGAT
TTTGCCGGCA ACGGCGAAAA TCGCTGGCAG GCGGTCGAGC GCGATACCCT CTGCTGGCCC
TATGGTATAT CCTGTCACAA CCATCGTTTG GCGATTGCCG ATTCGGGCAA TAATCGTGTT
ATCATATGGG ATATCAGCGT CTGA
 
Protein sequence
MVRPLVWIGA RSPGALTLPP ADPTPSQLYA PRGVYLDDQT LIVADSGNHR VLIWHRIPDR 
DGQPADVVLG QPDFYSEGPQ AAGRGSRHGM HLPTGVMVID GRLCVADSWN HRILVWNRVP
ETSNAPPDRV IGQADLDECE PNRGGGVTGC GFYWPYGIGW VAGRFYVADT GNRRVLSWNS
IPEDRQPPDL VLGQNDECSH AENRGEGPSP CSFRWPHAIA GNGTTLYVAD AGNHRVLGWT
PIPARDTPAC LALGQRDFQS AWEMPHTPPG PSALRFPYAV ACASGRLIVA DTANNRVLMW
HTLPRAGVFL PADMVIGQPD FAGNGENRWQ AVERDTLCWP YGISCHNHRL AIADSGNNRV
IIWDISV