Gene Rcas_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1838 
Symbol 
ID5539316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2347417 
End bp2348541 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content63% 
IMG OID640893976 
Producthypothetical protein 
Protein accessionYP_001431947 
Protein GI156741818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGCGCC GCCTGGCGCT CGTGCAGCGC GGGTGGCGGC ACGAGCGCAG TGCGGAAGCC 
GTCGAGTCCC TGGCGCTGAT TGCCATCGTT CTGGTGTTGC TGGCGGTTAT CAGTCTGGTG
TTCCGCGACC GTGCAGCCGC CATTGGCGAT GCAGCGACGG CGACGCTCGC ACGCTGGCTG
GCTGGCGTGC CGGGTACAGT GCCAGTTGGT AACACTAGCG TCATTGCGGC GCCGCAGGTA
ACGGCGCTCC CCTATGCTGT GGTGAGCCTG CCGCAACTGC TGAGGCAGGC GGCGGATGCT
GCGCCGTGGG TGGAGCGCGC GATCGGGATA GCGGGCAGTC TGCTGGCAGG GATCGTCGCA
TGGCTTACGG TAACGCATCA AGCGCCGCAG CGCACGTCCA ATTCGTGGTT CAGCCAGGCT
GTCGACGCGC TACGCTGGGC GGGTGAGCAG GCCGCTGGGG TATTCGTTGG CCTGTCCGAG
GGGGTGTATG ACACAGTTGC CGGCCTGGTC ACCCTCGGAG TCGATCTCGT CAAAATGATC
GCAGGCGATG CTGGCACGCG GCAGAAATAT GGAGCGCTGA TCGAGGCGCT GATCACCGAT
CCGCTGGGGA CGGCAGGAAA CGTACTCTGG TCGATCGTCG AGCCGATTGT GACTGACTGG
CAGGAAGGCC GGTACGGAGA AGCGGTTGGA CGGACGGTGT TCGAAATATT GCCTGCGATT
CTGGCGATCT TTACCGGGGG TGCAACTGCT GCCGGGTACG CGACGAAGGC GGGCACGGCG
GGTAGAGCGG TGGATGTGCT GGGGGACGCT GGACGGGTAA TGAACCGTAT CGACAACGCA
GCCAGGGTAG CAAACCGGAT TGACGATGCG GGCAGAGTAG CAAACCGGAT TGACGATGCA
GGTAGAATAG CAAATAGGAT CGATGATGCG AGCGACGTGG CGAAGGGAAA TGCTGTCAGC
GGCGCCGCTA CCCGGCTAAG CCCGTCACAA CTCCGTCAGC TCAGGCGGCA GGCTGGCACG
GTCGCACGAA CAGGCGTGCC ATATGATGAT CTCGGTTTTC CGATATTCGA TTCATTCTTC
GACTCTCACG GACAACGGAG AAGTTATTGC CACGCTCAAC GGTAG
 
Protein sequence
MERRLALVQR GWRHERSAEA VESLALIAIV LVLLAVISLV FRDRAAAIGD AATATLARWL 
AGVPGTVPVG NTSVIAAPQV TALPYAVVSL PQLLRQAADA APWVERAIGI AGSLLAGIVA
WLTVTHQAPQ RTSNSWFSQA VDALRWAGEQ AAGVFVGLSE GVYDTVAGLV TLGVDLVKMI
AGDAGTRQKY GALIEALITD PLGTAGNVLW SIVEPIVTDW QEGRYGEAVG RTVFEILPAI
LAIFTGGATA AGYATKAGTA GRAVDVLGDA GRVMNRIDNA ARVANRIDDA GRVANRIDDA
GRIANRIDDA SDVAKGNAVS GAATRLSPSQ LRQLRRQAGT VARTGVPYDD LGFPIFDSFF
DSHGQRRSYC HAQR