Gene Rcas_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3849 
Symbol 
ID5541353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5031008 
End bp5032270 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content62% 
IMG OID640895959 
Producthypothetical protein 
Protein accessionYP_001433904 
Protein GI156743775 
COG category[S] Function unknown 
COG ID[COG2833] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.565023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.161069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACA TCTACGCCGG TTACCGGCTC GATAGGGGGT TCCATGTGGC TCAGACGGCC 
CGCCTGGTAG CACGTTATCA CTATATCGAG GCGCAATTGA TGCGTATTGC GGCAGGACGA
ATGGCCGAAC TGCCTGAGTG GGAATTGAAA TGTCTGCTGG GCCGGCATCT CTGGCAGGAT
TCGCAGCACG CCGATGCATG GCTGTCCCGT CTGATCGATC TGCGCTGGCC ACGCCGCGCG
CCGCTCAACC CTGGCGAGGC GACCTGCCGG ATGTTGCGGT TGCTCGATGA TGCACCCGAC
AGCGCTGCAT TCGTTGCTGC GGTCTATCGC CAGGTGAAGC CGCGCCTTGC GGCAGCGTAT
GCCGCTCATC GCCAGGCGTG TGCATCACTG GCTGATGAGC CGACATGGGA TCTGCTTGCG
CGAATCCATG CCGACGAAGA AGAACAGATA AGCCAGGGCA TGGCGCTGCT CGAGTCGTTC
TCGTCTGCGG CGCGTGCCGT TGCGCAGAAC TACGAATCCG CTGTGGCTGA GGCATGCGAT
GGAATCGGCA GTTTCGCCGA TCCGGCAACT GCGGCTGAGC GCGAAGCCAG CGGTTCCTTC
GAGGACCAAA CGGTGCGACC GGCGCCGCCG ATCGCCGCCC GCGATGCGCG CTTCCGCTTC
GGCGAGCGGG CCGCCGATCA TCCACCCGCC GATGAACACG AAATGGCGCT GATGATGGCG
CACCGCGATG CCGACAATGA GATGCACGCC GCCGAACTGC TCGGTCGCAA CCTCTATGAG
CATCCAGAGA TGCCCTGGGA ATATCACGTT GATATGGCGC GCCAACTTTG GGATGAAGTG
CGCCACGCCG TGCTCTATCA GCGGTACCTG GAGCATCTGG GCGGCAAACT GGGCGATTTT
CCGGTCATCC CCGGCAACTA CGCCTATCGG ATGAGCCTCG ACTTTCCGCA CCGGCTCTAC
GATCTGCATC TACGCGGCGA AAAACTGGGC ATGCCCGATC TGATTCGCTC CCGCGAAACC
GCCCGCGCAC GCGGCGATAC CCCGTATGCT CTGCTCAACG ATTTTGTCCA CGCCGATGAA
GTGCCGCACG TGAAGAACGG GCGCTGGCTA CGCTGGCTGC TCAACAATGA TGAAGCGGCT
TTCCGTCGGA TCGAGCGCGA GACGATGCGA CTGCGCGCAG CATATGAACA GGCTCACGCC
GATGACCCGA TTGTCATGGC GTATACCGGA CTTGTGCCAG CGATCTTGCA ACAAGAGGAA
TAA
 
Protein sequence
MYDIYAGYRL DRGFHVAQTA RLVARYHYIE AQLMRIAAGR MAELPEWELK CLLGRHLWQD 
SQHADAWLSR LIDLRWPRRA PLNPGEATCR MLRLLDDAPD SAAFVAAVYR QVKPRLAAAY
AAHRQACASL ADEPTWDLLA RIHADEEEQI SQGMALLESF SSAARAVAQN YESAVAEACD
GIGSFADPAT AAEREASGSF EDQTVRPAPP IAARDARFRF GERAADHPPA DEHEMALMMA
HRDADNEMHA AELLGRNLYE HPEMPWEYHV DMARQLWDEV RHAVLYQRYL EHLGGKLGDF
PVIPGNYAYR MSLDFPHRLY DLHLRGEKLG MPDLIRSRET ARARGDTPYA LLNDFVHADE
VPHVKNGRWL RWLLNNDEAA FRRIERETMR LRAAYEQAHA DDPIVMAYTG LVPAILQQEE