Gene Rcas_3437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3437 
Symbol 
ID5540936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4484968 
End bp4486968 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content61% 
IMG OID640895555 
Producthypothetical protein 
Protein accessionYP_001433505 
Protein GI156743376 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.54517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0867765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGTT TCCCCGCCCG ACCGTTGCCG ACCCTGCCCG GCGCTCTTGC ACTGCTCGTG 
ATTGCACTCC TGTCTGCCTG CGCCGGCGCC GTGCCGTCAT TGCCCCCTTC CCATCTTCCT
GTGATTGCGT CGCCGACGGT TGCGCCGTCT GTCACTCCGT CATCCGTGCC TGTCGCTTCT
ATCCGCCCCA CCGCATCTAT CGTGCCCGTT ACCGCTCCCA TCGACGAACT GACGACGATT
GCCGCTGCCG TCCCTGCTCC TCGTGATCAG CGCGCGATCA GTGCGGCGTT CCATGGCGGC
GACATCCCCT ATGTGGCCCG GACGATGCCG CTCGACGTTC GGATTGGCGC AACCGAGACC
TTCTGGGTGG CTGATGTCTC GAACAATGTG AACTATACTG TCACAGCACA ACTGCGTTAC
GCCGGTCCGG TTGTGTTGAT GTATATTGAT ACGACGCTCG ATGTCCCGCA ACATCTGATC
GAGCAGTCGG CGCAGGTCTT CGAGGAACGG ATCTACCCGC GTAATCGCTT GTTGTTCGGT
GAGGAACGCA TCCCCGGCGT CGATGGCGAC GCGCGACTGA CGATTCTCAA TACCCGCATT
CGCGGAGCAG GCGGGTATTT TTCGTCAGCC GATGGCGTGA CGCGCGCGGT CAATCGTTTC
AGCAACGAGC GTGAGATGTT CGTCATCGAC GCAGTCGCCT TCCCTCCCGG CAGCGAGACC
TACAACGCAA CGCTGGCGCA TGAGTTTCAG CATATGATCC ACTGGCACCG TCAGCCACGC
AGCCCAACAT GGTTTAACGA AGGTCTCTCG ATGCTCGCCG AGGACCTGAA TGGATTGGGC
GACAATGGCG CGGCATTGGC GTATCTCCGC AATCCCGACA CGCAACTGAC GACGTGGGCG
CCGGGAAGCG GCGTTACGCG CCACTACGGT GCAGCGCAAC TCTTCATGCG CTATCTGTAT
GAACAGTATG CTGGCGACAG TCGCCCCGCC GACTGGATCG ACGCTGATGC GGGCAACAAT
GTGCATGTTC TGGCAAATCT CGCCGCTTAC CGTCGCCCCG ATATTGTCAC CTTCGCGGAT
CTGTTTGCCG ATTGGGCAGT CGCCAATGCC TTGAATGATC CATATGTGGA CGATGGACGC
TATGCGTATC GTGGCATTCC GACGCGCGCC GCAACGATGC GCCTTGAACC GGGAACAACC
TCCGCTACAG TGCGTCAGTT TGGAGTGGAT TACATGGGTC CGCTCGACGG TCCGCTGGCA
ATCGATTTCG ATGGCGCCGA TACGGTGCAG TTGGTTGGAG TGTTGCCGGC CGAAGGGCGC
TTCGCCTGGT GGAGCAATCG CGGCGATGAA AGCGTCTCGA CACTGACGCG ACACCTCGAT
CTGCGCAGTG TGTCGCGCGC AACGCTCACG TTTCGTCTCT GGCACGAACT TGAGCGCGAC
TACGACTATG CCTTCGTCAC CGTCTCCAAC GACGGCGGTA CGCACTGGCA GACGCTCCCC
GGCATCACCA CTCGTGCCGA CGATCCGCAG GGGCACAACA TGGGGTACGG ATTCACCGGC
GTCAGCGGCG CGCCGGATGT CGCCCTCGGC GGCGTGCGCG GACGCTGGAT CGACGAGCGC
ATCGACCTGA CGCCGTTCGT CGGTCAGGAC GTTCTGCTGC GCTTCTGGGT CATCTCCGAT
GCGGCGATCA ACGGTCCTGG CATGCTGATC GATGATATTC GAGTTCCGGA GATTGGCTTC
GCCGATGGCG CCGAAACCGA TGACGGCGGA TGGGACGCGA TAGGGTTCGT GCGCACATCC
GGCATTCTTC CGCAACGCTG GGTCGTGCGG TTGCTGTTGT TCGACAGCGA TGAAACGCGG
GTGATCATTC CAGAGATCGA TAATCAGGGG CGCATCAGCC TCCGGGTCGC TGCCGGGCAG
CGCGCAATAC TGCTGGTTAG CGGCGCGACT CATTTCACGA CTGAACCGGC TTCGTACCGA
GTCAATCTGT ATCAACCGTG A
 
Protein sequence
MNRFPARPLP TLPGALALLV IALLSACAGA VPSLPPSHLP VIASPTVAPS VTPSSVPVAS 
IRPTASIVPV TAPIDELTTI AAAVPAPRDQ RAISAAFHGG DIPYVARTMP LDVRIGATET
FWVADVSNNV NYTVTAQLRY AGPVVLMYID TTLDVPQHLI EQSAQVFEER IYPRNRLLFG
EERIPGVDGD ARLTILNTRI RGAGGYFSSA DGVTRAVNRF SNEREMFVID AVAFPPGSET
YNATLAHEFQ HMIHWHRQPR SPTWFNEGLS MLAEDLNGLG DNGAALAYLR NPDTQLTTWA
PGSGVTRHYG AAQLFMRYLY EQYAGDSRPA DWIDADAGNN VHVLANLAAY RRPDIVTFAD
LFADWAVANA LNDPYVDDGR YAYRGIPTRA ATMRLEPGTT SATVRQFGVD YMGPLDGPLA
IDFDGADTVQ LVGVLPAEGR FAWWSNRGDE SVSTLTRHLD LRSVSRATLT FRLWHELERD
YDYAFVTVSN DGGTHWQTLP GITTRADDPQ GHNMGYGFTG VSGAPDVALG GVRGRWIDER
IDLTPFVGQD VLLRFWVISD AAINGPGMLI DDIRVPEIGF ADGAETDDGG WDAIGFVRTS
GILPQRWVVR LLLFDSDETR VIIPEIDNQG RISLRVAAGQ RAILLVSGAT HFTTEPASYR
VNLYQP