Gene Rcas_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1689 
Symbol 
ID5539165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2175078 
End bp2178272 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content64% 
IMG OID640893826 
Producthypothetical protein 
Protein accessionYP_001431799 
Protein GI156741670 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000757543 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGATTG TCATTCTCAT ATTCATTGTT CTGACGCTCA TCTTGATCGG CGCGATCATC 
GCTCACGCGC TGCCTGTCGC ATGCCGCGCC GATGATCCGC TCGAGCGATA TTTTGAGTAT
GCGCTGATCG GTGCGTTGCT CAACGGATGG CTGGCGTTCA CCCTGGCGCA GATCGGCGCC
TTCTCCGCGC TGCTGCACGC CGCTATCATT GCCGTTCTTT GCCTTATCGC TCTGCGGATC
GGTCGCCGCC ACGCGCCCAA CACTCCGACC ACGCCAGCAG ATGTGTGGAG GCGGTGCATT
GCCTGGGTCA GGCAGTGCAC TGCACTGCCT CTACGGGAAC GTTCAACATC CCTCCTCCTT
GTGGTCCTCC TTCTCGTTTT CGCTCTCCTC GTTTCTCGCC CCTTCGAGAC GATCATTGGC
GTGCGCGATG CTGGCGTGTA TGCCAACGCC GGGTTCATTA TGGCGCGCAC CGGTTCACTT
ACCTTCACCG ATTCAGTGGT GGCGCAGATC GCCGCCGATC AGCAGTCGTC CGATCCTGAG
ATTGCCGATG CAGCGCGTCA GGCGGAAACC AACTTCCTAG GCGTGCAGAA TCCGCAACGG
TTTATCGCCA CACGCCTGCG CGCTGCCGGC TTCTTCATCG ACCAGGGAGA CCTGGCGCGC
GGGCGCGTCG TGCCGCAGGG GTTCCACCTC TTCCCGGCAT GGATTGGACT GATCGCTGCG
TTCTTCGGAT TGCGCGCCGG ATTACTCGCG CCCGCCGTCA CCGGTCTGCT CGGCATCTGG
AGCGTCGCCA TGCTCACCCG TCGCCTCGCC GGTCCGTGGG TTGCGCTCGT GGCGGCGCTC
TTCCTGACGC TCAATGCCGT GCAGGTCTGG TTCAGCCGCT ATACGACCGC CGAAACCGCC
GCGCAGTTCC TTGTCTTTGC CGGACTCTAC GCCTTTGCCG CCGCATTTGG ACATTCGATG
GCAAACGTTC AACGCACCAT GCTCCTCGCC CTCCTCGCGG GGCTGGCATT CGGGCAACTC
GCGCTGACGC GCATCGAATT CTTCCTGGTT CTCGGTCCGG TGGCGCTCTA CCTGCTGCAC
GCCTGGCTCG CGCGTCGCTG GACGCTGCCG CATACCGCGC TGCTGGCGGG CATCGGAGCG
ATGGTGCTGC ACGCCGGGTT GCACATCGCA TTCCTGGCGC GCGCCTATTT TTTCGATACG
CTCTTCGCCC GCCTCCAGGA TTTTGCCCTT ACTGCGGCAT TCGCCCTGCC ATTCCTCACT
CCAACGCTGC GTCAGGTTTA CCTGTTGCGC CCCTGCTCGC GCCTGACGAT GCAGCCCTGC
CCGCCGATTG CCGGCATGCC GCCGACTGCC GATGCGCCGC TGAACTGGAC CCGTATCGGC
ATCGAGGCGC TGGTCGTTGT CGTGGTTGTG GCCGCCCTGG TCGCAATCCG CCGCCTGAAC
CTGATCGCCC GCGTCTCGCC GCTGCTGGTG CGCCTGAACC GTCCGTTGCG TCTGGTTGCG
GCAATTGCCA TCATCGGGAT CGGTGGATAT GCGTACCTGA TCCGTCCGCA GATTCTGTCG
CTGCCGGTCA TCACCGCGCT CCCCGCATGC CTTGCCCCTG AACAACTGAC GAACCCGCAG
GGGGCATGCC TGACGCTCCA GGGGTATGTC GGCGCGCCAA TTGCGACGCC AGCCTATGTG
GACCCGCTCG CCGCGTGGTT CGACCGCGCC ATTGGCGCTG TGCGCGGGCG CGCCGCTCCG
CCGCTGGACG CCTGTATCGC GCTGCGGCGC TCCACGTTGC CGCCAACTGC CGATGGACGC
ACCATACCGG AGGTTCTTCG AGACGGCTTG CTCGACGAAA CCGACGTTCC GCCTGAGATG
CTGGCAACCC TCCGCGCTTG CGACCGCTAC GTGCTGCGCG ATCTGTTCGG CGCAGCGCAG
GCGAACCTGG TGCGCCTGGG ATGGTATCTT TCGCCGCCGG GCATCGCGCT CGCGCTCATC
GGACTGGCGC TCCTCGCCTA TCGCGCCAAC TCGACCTCCT GGTTGTTTCT GGTCATCGCC
GCTGTTGCGT CGGTCGTGTT CCTGCGACTG ACCTACGGAA CCAGCGACCA GCACTACATC
TATATTATGC GCCGCTACGT GCCGCACGTG TATCCCGCAT TCGCCATCGG CGCCGCCTAC
GCCATCGCTC GCCTCTCGTT CAACGTTCAA CCCTCAACGT TCCACATTCC ACGTTCCACG
TTTTCCCGTC TCATCCTCAC TCTCGTCCTC GTTCTGTTTC TGGTCGTTAC AGGCAGACCG
ATCTACCGTC ACACTGAATA CGCTGGCGCG CTCGATCAGA TCGGCGCTAT GGCAGGGCAG
TTCGATCCCG GCGCAATTGT CCTCATGCGC GGCGGTGCGC CTTCCTTCGC ACAGGCGCGC
GACATCCCCG ACCTGCTTGC CACACCGCTC ACCTTCGCCT TCGGTATTGA CGCATTTGCG
CTAAAAAGCC GCGACCCCGG ACGATACGCG CCGCAACTGG CGCGCTACAT CCGCCGCTGG
CACGACCAGG GACGACCGGT GTATCTGGCA ATCGGCGCGA GTGGCGCAAT TGCGCTACCA
GAGTGGCGAC TCGAACCGGC TGGACGTCTG CACGTCGACC TGCCAGAGTA CGAACAGCCG
ACCGATCACA AGCCCTCCGC CGTTCAGCGC TTCACCCTCG ATTTTGCGCT CTACCGCCTG
TTGCCCGACG AACCGACGCC AGCGGAACCG CCAGCGCTCA CCATCGCGCC CGACGACTAC
GCCTATCAGG TGCGCGGCGT TTACCGCGCC GAACGCATCG GCGACCGCCT GATCGCCTGG
ACCGATGGCG ACGCGATCTT CCGCCTGCCG GCGCCGATGA CTGAGCCGCT CACCATCAGC
GTAACACTTG CTGCCGGCGC GCGCCCCGCA ACGCTGCCCG GCGAAACCTG CCTGTCGCTC
GCCGCCGAAC CTGGCTCCTC GACCGATGAA GCGACGTTTA CCGCGCCGGT CTGCACCGTC
CCCGGCGCCG AACCTCTCAC TATCACACTC AGCGCCGACC CGCGCAATCT GCCGCGCTCA
CCGACCGGAC ATCTCCTGTT GCGGGTGCAA ACCCCACCCT TCATCCCCGC CCGCGACGAT
CCCGCCAGCC ATGATCCGCG CCGACTTGGG GTTCAGATTG TCGCGTTAGC GGTGAGGAGC
GCGCCCATTC GATAA
 
Protein sequence
MTIVILIFIV LTLILIGAII AHALPVACRA DDPLERYFEY ALIGALLNGW LAFTLAQIGA 
FSALLHAAII AVLCLIALRI GRRHAPNTPT TPADVWRRCI AWVRQCTALP LRERSTSLLL
VVLLLVFALL VSRPFETIIG VRDAGVYANA GFIMARTGSL TFTDSVVAQI AADQQSSDPE
IADAARQAET NFLGVQNPQR FIATRLRAAG FFIDQGDLAR GRVVPQGFHL FPAWIGLIAA
FFGLRAGLLA PAVTGLLGIW SVAMLTRRLA GPWVALVAAL FLTLNAVQVW FSRYTTAETA
AQFLVFAGLY AFAAAFGHSM ANVQRTMLLA LLAGLAFGQL ALTRIEFFLV LGPVALYLLH
AWLARRWTLP HTALLAGIGA MVLHAGLHIA FLARAYFFDT LFARLQDFAL TAAFALPFLT
PTLRQVYLLR PCSRLTMQPC PPIAGMPPTA DAPLNWTRIG IEALVVVVVV AALVAIRRLN
LIARVSPLLV RLNRPLRLVA AIAIIGIGGY AYLIRPQILS LPVITALPAC LAPEQLTNPQ
GACLTLQGYV GAPIATPAYV DPLAAWFDRA IGAVRGRAAP PLDACIALRR STLPPTADGR
TIPEVLRDGL LDETDVPPEM LATLRACDRY VLRDLFGAAQ ANLVRLGWYL SPPGIALALI
GLALLAYRAN STSWLFLVIA AVASVVFLRL TYGTSDQHYI YIMRRYVPHV YPAFAIGAAY
AIARLSFNVQ PSTFHIPRST FSRLILTLVL VLFLVVTGRP IYRHTEYAGA LDQIGAMAGQ
FDPGAIVLMR GGAPSFAQAR DIPDLLATPL TFAFGIDAFA LKSRDPGRYA PQLARYIRRW
HDQGRPVYLA IGASGAIALP EWRLEPAGRL HVDLPEYEQP TDHKPSAVQR FTLDFALYRL
LPDEPTPAEP PALTIAPDDY AYQVRGVYRA ERIGDRLIAW TDGDAIFRLP APMTEPLTIS
VTLAAGARPA TLPGETCLSL AAEPGSSTDE ATFTAPVCTV PGAEPLTITL SADPRNLPRS
PTGHLLLRVQ TPPFIPARDD PASHDPRRLG VQIVALAVRS APIR