Gene Rcas_4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4165 
Symbol 
ID5541676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5388520 
End bp5389707 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content58% 
IMG OID640896276 
Producthypothetical protein 
Protein accessionYP_001434214 
Protein GI156744085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0171049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTTCAC AACTCTTGCT TGTCACCACA CTGCTTGCCG GTGCGATCTC TGTCGCCGGG 
TGCGGCAATC TGCCCTTCCT CTCGCAGGGT AACCCGCAAC CAACCGCCTC GCCAGCTGCT
TCCGGCGATG CCGCAACGCC TGCGACACAA CCAACTGCCG CCGCTGAACA ACCGACACAG
GCGCCGGCGA ATACTCCGGC ACAGACGATA CCCACCGCAC CACAGACCGG CGGCGCTCCT
TCGACACCCG CCACGGGCGA TGGCGCTCCG ACGATTCCGC TGCCGCAAAG CCAGGCGAAC
CTGGCGCAAC TCGAAAGTTA CCGCATAACG ATTGTATCGA AGATGAACGG CAAAACCACA
GACGGGAAAA CCATCGAATC ACAGATGACC TATACGCAGG CGCTGCATCG ACCCAGTAAG
ACCGGGTATA CGCTGGTTGT CGAAAACAGG CAGGGTGCAC CATCCACCCA ATCAGAACTC
TACAGTGTTG GTGATATGTT GTATATCTAC CAGCGCCAGG GTGGGAAAGA ACTCTGTCAA
CCGGGCATGA TGGCTGGCAT GGGCGACATG CTACGCGGCA TTGCCGACGC AATGACCGCT
CCCCTTCAGA CCGGCACGGC ACAACTCGTC AATCGCGGCG AGACGGTCAA CGGCGTGCTG
ACCGATCGCT ACACGCTGGA TCAGGAGACG ATCAATCAGT TCGGCGCGAC GGTGGAAAAA
GCCGATCTCT GGGTGGCGCG CGATGGCGGC TATCTGGTCA AATATGACCT GACGATCAAC
GTAACGTCCA ACACCACCGG TTGGGCAGCG ATGCTCGGCG GCGGCGCACC GATTTCGGAA
GGCACGATTG TCTACAACTG GTCGCTTGAA GACATCAACA AAACGACCAT TACCCTTCCA
TCCGCGTGCA GCGAACAGAC GGTCGGCGTC GATCTGCCGC TACCAGCCGG CACACAGGTC
GATATGGCGA TGCCGAACAC AACGATGGGC AAGGTAAATG CCGGCATTGA CTCGGTTTTG
ACCTTCTTCA AGACGGAGTA TCCCAAACTC GGATACGAAC TGACAGACGA GTATGGCGAT
GCTCAGAATG GCTACATCCT CAACTTCAAG AAAGGCGCGG ATGAGGTGAT GGTTCAGTTC
TCAACCATGT CGGATGGGTC GGTGCAGATC ACACTCACGC GCGGCTGA
 
Protein sequence
MRSQLLLVTT LLAGAISVAG CGNLPFLSQG NPQPTASPAA SGDAATPATQ PTAAAEQPTQ 
APANTPAQTI PTAPQTGGAP STPATGDGAP TIPLPQSQAN LAQLESYRIT IVSKMNGKTT
DGKTIESQMT YTQALHRPSK TGYTLVVENR QGAPSTQSEL YSVGDMLYIY QRQGGKELCQ
PGMMAGMGDM LRGIADAMTA PLQTGTAQLV NRGETVNGVL TDRYTLDQET INQFGATVEK
ADLWVARDGG YLVKYDLTIN VTSNTTGWAA MLGGGAPISE GTIVYNWSLE DINKTTITLP
SACSEQTVGV DLPLPAGTQV DMAMPNTTMG KVNAGIDSVL TFFKTEYPKL GYELTDEYGD
AQNGYILNFK KGADEVMVQF STMSDGSVQI TLTRG