Gene Rcas_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3641 
Symbol 
ID5541143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4762246 
End bp4764522 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content49% 
IMG OID640895761 
Producthypothetical protein 
Protein accessionYP_001433708 
Protein GI156743579 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.022176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000125386 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGGATCG AGCCGCGACG CGTGCTTGAT GTGGGGGTTG GATTTGGCAG ATGGGGCATC 
ATTGTGCGTG AGTTTTGTGA TGTCTGGTTC GGTAGAGTAT TGCCGGAACA ATGGTCGGTA
TATGTCGAGG GGATTGAGGC TTTTGCGCCG AGTATTTCGA GTTATCACTC GTCTTTCTAT
AATAAAATCT GGGTGGGAGA TGCTGTCGAA GTTTTTTCTA GGATAGACAA AAGCTGGGAT
GTCGTGATAT TTGGTGATGT TCTGGAGCAC TTCACTCGAG AAAATGCAGA GAAGTTATTG
TTATGGTCAG TTAATCACTC AAACTATGTC ATTGTTAATA TTCCCATCGG TGATGATTGG
GATCAAGGAG AAATGTATGG AAATCCGTAT GAACAACATC GCAGCGTATG GGTTGAGGAG
GATTTTCATG GCTTCTTCAT GGTAAGAAGG GCGCTATTCA ATGACTATCT CGGGAGGAAG
CACGGATCTT TTGTGCTATC GCGTCATGAT CCAAGAGGTG TCGCGTTAAG GCTGTTCTCG
GAACGAACCG ATGAGTTTCA GAGTGAGATC GGAGTTTCTA TGGATCGGTT AACTCATCAT
GTAGATTTGG ATTCCTTACA TAACATTGTC GAACGAACGC GTGCTATCAG CGAAGAACTC
GAATCTATCA AGAACTCACG GAGTTACCAG GCAATGGTGC AGTTTCGTCG ATCGCCTATC
GGCCATATTG CTGCGAAAAC GTTGCGATAT GTCGATCAGA GTGTTGGGAA AATAAAAGGC
CTTTCAAAGA TCAGGCAAGG AATTGTCGGA TTGGTTTATC AGTATGTGCC CAGGGTGAAC
ACTCGTCTCC TGGAGCAGAT GTATCATTTA CCTCACGGTA TCGTTCGTTT GAGGTTGATT
GGCAAAAATC CGCACAGCCA GGGAGCGGAA GCGTGGATTC TGGCGGTGCG GAACAATCAT
GGCTTTCTCA GCGCCGGTCA GATCAGATTG TACGGTTCCT GGACGGTGCG TGAGGGGAGT
GGCATGAAGG GTATGCCTGC ACTCGTTGCT TCAGAGTATG GTTGGGCTGA TATTCCGGCG
GAGCATGGTG CTCATCTGGT CTTGCTGCGG CACCCATGGA GTGGTATTGT CGAGTTACAG
TTTGGGAATG TGCGACGTCG GTTTGACCTC TATGCTTCGC AGGCACACGA TATTATTATT
GATCTTACGA CGCTAGAGCA GAAGCAATCG CTTCCTACCG TGCGTCCCGT TCCGTTGCCG
GATACTTTCA GACGTTGGCT CGAGACGACA GATTTTAGCC AGGGAGTTGT TACGGTTGTC
AATCCAGAGT GGCGAGGAAT TTATAGTTCC ACTAAGAATC TCTTTGATAC CATTCTGGAA
CTGCCTGATC ATCTTGACGA AGCATCGGGT TTGCAATATG CCTGGTTGCT TGCAGAAACG
AGATGTCAGG TCGTAGTGAT TCAGGGTTTT CCGATGACGT ACTATCATTT AGTGACAGCA
CTGCGTCGGA TTGCTCCGAA TATGCGTATT GCTGTTATCT GGCACGGGAC ATTCTTGCAA
TTGACGGAAG ATTACGTCTG GCAATCCTTC CGGCAGGTTG AACGATTTTG TCGCGAGGGT
GCTATCTGGA AATTGGGGTT TGTTAAGGCG CGCATGGCTG AGATTATGGC GAAACGTGGT
ATACGAACCG GATTTGTACA AAACATGGTG CGCGAGGTGC CAGATAAGCC TTCGGAGCCG
TTACCCGGCG GTCCACACAT TGGTGTCTGG TTGCTTTACG ATGGATGGCG CAAGAATCCT
TTCGCTTCTA TTGCTGCAGT GTCTGCGCTT TCTGGTGCGA CTCTTCACAT GTCGGCTGCC
TCCGACCGTG TGCAGGAGTT TGCTGAGTTT ATGAACATAC GCAGCAATAT ACACTTTCGG
CCCATCGACC AGGGTTTGAT GCGACATTAT CTGGCGCAGA TGCATCTGAA TCTTTATGTG
ACTTTGAGCG AATGCGCGCC GATGCTTCCG CTCGAAAGTC TATCAGTAGG TTCTCCGTGT
CTGTTCGGTC CTACGACGTA TTACTTCGAC GATCATGACT ATTTGCGCGA ACGTCTCGTG
GTGCCGCAAC CAGATGATGC AGGGATGATT GCTGCGTACA TGCAGCGCGC TCTGGAGGAG
CGAGGTGAGA TCATCGCCGC CTACGCCAGG TATGCGCCAG ATTATAATCG CCGCGCGAGA
GCGACGCTCG CTGAGTTTTT GGAATTCGAT GGTTGTGATA GTGCGAAAAA TACATGA
 
Protein sequence
MRIEPRRVLD VGVGFGRWGI IVREFCDVWF GRVLPEQWSV YVEGIEAFAP SISSYHSSFY 
NKIWVGDAVE VFSRIDKSWD VVIFGDVLEH FTRENAEKLL LWSVNHSNYV IVNIPIGDDW
DQGEMYGNPY EQHRSVWVEE DFHGFFMVRR ALFNDYLGRK HGSFVLSRHD PRGVALRLFS
ERTDEFQSEI GVSMDRLTHH VDLDSLHNIV ERTRAISEEL ESIKNSRSYQ AMVQFRRSPI
GHIAAKTLRY VDQSVGKIKG LSKIRQGIVG LVYQYVPRVN TRLLEQMYHL PHGIVRLRLI
GKNPHSQGAE AWILAVRNNH GFLSAGQIRL YGSWTVREGS GMKGMPALVA SEYGWADIPA
EHGAHLVLLR HPWSGIVELQ FGNVRRRFDL YASQAHDIII DLTTLEQKQS LPTVRPVPLP
DTFRRWLETT DFSQGVVTVV NPEWRGIYSS TKNLFDTILE LPDHLDEASG LQYAWLLAET
RCQVVVIQGF PMTYYHLVTA LRRIAPNMRI AVIWHGTFLQ LTEDYVWQSF RQVERFCREG
AIWKLGFVKA RMAEIMAKRG IRTGFVQNMV REVPDKPSEP LPGGPHIGVW LLYDGWRKNP
FASIAAVSAL SGATLHMSAA SDRVQEFAEF MNIRSNIHFR PIDQGLMRHY LAQMHLNLYV
TLSECAPMLP LESLSVGSPC LFGPTTYYFD DHDYLRERLV VPQPDDAGMI AAYMQRALEE
RGEIIAAYAR YAPDYNRRAR ATLAEFLEFD GCDSAKNT