Gene Rcas_3208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3208 
Symbol 
ID5540706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4170442 
End bp4172511 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content62% 
IMG OID640895329 
Producthypothetical protein 
Protein accessionYP_001433280 
Protein GI156743151 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.905159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.252675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGT ATGCTGACCT GGAAATAGTG GTCACTCCGG CAGAGAGCGG TCGCTTTCTC 
CTCAAAGCGC GCGGACCGCG CGGTGAAGAG GGTGATGGTG AACTGCGCCT GCCCGACGCC
GAGCCGCAGA TACAGGCGCT GTTGGCGCGT CTGCGCGCGC TCGATCTCGA CGAAGCGGCG
CTGGTAATGC TTGGTCGGGC ATTATTCGAC GCACTCTTCA CCGGCGCCGT GCGTGATGTG
TATGTCCGCT GCCGAGGCGC GCTCGCCAGT GATGAAGGTC TGCGTCTGCG CCTCAATATT
CCGCCGTCGG CAGCAGCAGT GGCCACACTG CCATGGGAAT TTCTCTATGA CCCGGATCGT
GGTCCTCTGG CGTTGCTCGA TGTGCCGGTT GTGCGTCATC TGCCGCAGCC GAACCGCATC
CCGCCGCTCA CCGCACCGCT GCCGCTGCGC GTGTTGCTGA CCGCTGCGCA AACGCCGCCG
CCGACTGCCG TCGAACGCGA ACTGGCAGCG GTTCAGACGG CGCTCGAGCG CTTTGGCAAC
CAGGTTTCCG CAACAGTCGA GCCGCACCTG ACGGCTGCCA CGTTGCAGAA TCGGTTGCGT
GAAGGGTATC ATATCTGGCA TTTCGTCGGT CATGGCGGGT TTGCAGCCGA CGGCGCCACG
GCGTGTCTCC TGTTCGAGGA CGAACTTGGC GACCCGGAAC CAATCAGCGC GCTGCAACTT
GGTATTATGC TCGACCGGAG CAATCTGCGG CTGGTCGTGC TCGATGCCTG TTCCACCGGG
CAACTGACGC TTGACCCGAT GCGCAGCATG GCGCCGGCCC TGGTGCGCGC CCAGGTACCG
GCTGTCATCG CCATGCAGTT TCAGGCGCCG GAAGAAGCCA CCCGCGCCTT TGCCGGGGCA
TTCTACCGCG CCCTGGCGGA TGCCTTCCCG ATCGATTTTT GCGTTACCGA AGGGCGACGC
GCCGTGATGA ACATTGCCGG GCTTGGTCGC GCCGACTGGG GCATCCCGGT GATCTATACG
CGCACCGAGG ACGGACGCCT CTTCGATCCC CCTGCTGCTC ATACGCCCTC GACCGTGGCG
GAAGTCACGG CGCGTTCGGT CGGAACCGGC ATCCAGGCGC TCGAGAATCT CATCGAGGCT
GGCAGCGATG TGCGCGAGGC GGTGATCGCG TTTCGCGCCG ATTTTCAGGC TGCGGCGCGC
CAGATCGACA TCCTGGCGGA TTACAAAGAC GTGCATGATC AGCTGCACTC GCTCCAGTTC
CACTGCTACA GCCCGATAGT AATCGACATG CGCCGCCTGC CGGATGACGA CCTGGCATGG
GAAAGCATCG CCAATTACGA GGTGACGCTT CAGAGCATCC TGCGCGACCT GGAACAGGTG
GCTGAACGGA ACCGCCTGCC GATGAGTGAA TTGTCGTGGG TGGCGGATGT TCGCATTGCG
CAGGCTGACG TTACGCAAGC AATCGAAACG AACGACCTGA AACTGTTGAG AAAAGCGGTG
CGCCTGCTCA ACCGGGTGTT GACCACCCAA CCATCTCTGA TCAATGCCCG TCTCAACACC
GCAGCCCGCT CCTTGCGCCT GACCTCGCTC GTCGAGGGGA TGTCCGCCGT GCTCAACTGG
CTCCGCACTG CGGGGTACGA CGCAACCAGG ATCAGTCAGA TTGAACTCGG TGTTGCCGGA
TTGACGACCC TCAGCGCCAG CCTTGCGACG CTGGTCGATG AACATGATCG CTGGCAGATC
GTCGATCTCG ATCTCCGCCG GATTGAGCAG CTCATCGATC AGGATATCAC CGAACTGGAA
CTGTCGTGGC CCGATGTGCG CGAACGGGTA GCGCCACTCT ACCTGGAAAG TGCGGAATCT
TGGGGCGCGG CCTTGAAGAA TGATGCTGAT AAGGTCACCG AAGCGCTGGG TGCTGCCGAT
CCTACGAAGG CGCGGCAGTT TTTCCGCCGT TTTCGCCGTC AGGCGGGAGA GCGGTTCTTT
CGCGTCGATA TCGAACTCAA ACGTGTGTGT GATGAATTAC GCAAGGTCGG CGAGCCGCTG
ACTTCTGTTC TCAAGGTGCT GACGCCATGA
 
Protein sequence
MKSYADLEIV VTPAESGRFL LKARGPRGEE GDGELRLPDA EPQIQALLAR LRALDLDEAA 
LVMLGRALFD ALFTGAVRDV YVRCRGALAS DEGLRLRLNI PPSAAAVATL PWEFLYDPDR
GPLALLDVPV VRHLPQPNRI PPLTAPLPLR VLLTAAQTPP PTAVERELAA VQTALERFGN
QVSATVEPHL TAATLQNRLR EGYHIWHFVG HGGFAADGAT ACLLFEDELG DPEPISALQL
GIMLDRSNLR LVVLDACSTG QLTLDPMRSM APALVRAQVP AVIAMQFQAP EEATRAFAGA
FYRALADAFP IDFCVTEGRR AVMNIAGLGR ADWGIPVIYT RTEDGRLFDP PAAHTPSTVA
EVTARSVGTG IQALENLIEA GSDVREAVIA FRADFQAAAR QIDILADYKD VHDQLHSLQF
HCYSPIVIDM RRLPDDDLAW ESIANYEVTL QSILRDLEQV AERNRLPMSE LSWVADVRIA
QADVTQAIET NDLKLLRKAV RLLNRVLTTQ PSLINARLNT AARSLRLTSL VEGMSAVLNW
LRTAGYDATR ISQIELGVAG LTTLSASLAT LVDEHDRWQI VDLDLRRIEQ LIDQDITELE
LSWPDVRERV APLYLESAES WGAALKNDAD KVTEALGAAD PTKARQFFRR FRRQAGERFF
RVDIELKRVC DELRKVGEPL TSVLKVLTP