Gene Rcas_3886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3886 
Symbol 
ID5541392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5085330 
End bp5087090 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content61% 
IMG OID640895997 
Productmembrane protein-like protein 
Protein accessionYP_001433940 
Protein GI156743811 
COG category[S] Function unknown 
COG ID[COG3463] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.920923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGGC ACGAAAGCCT TCGTGGATCG TCCGGCAAGG CACGAGTTTT GCACAATCTA 
GGGTGCGACA TAAATGTCGC GGCACGAAAG CCTTCGTGGA TCGTGTGGCA AGGCACGAGT
TTCGCACATA TGACACAAGC GCTTGCATCA CAACGCGAAC GTTCCAACAC CTGGATCGCG
CACATGCTCA CCGCAATCGA GCGGCAGGCG AGTCTGGCGC TGGCAGCGTT TATCATGCTC
TACGTCGCAG TCCTTTTCTT TGGACTGACA TTCAAGTACT TCAGTTGGGC GCAGGGGTAT
GATCAGATTG ATTATCAGCA GTCGATCTGG AACACAACCC AGGGTCGTTT CCTCGAGATT
TCGCACTACC GCCACACCGA TCACCTGTGG GGCATGGATT TCATTCCGGC GATCCTGCTG
ATCGTTCCAT TCTATGCGCT CTTTCCATCG GCGCTAACGC TCAATTTCTT CCAGGCGCTC
TTTATGGCGC TTGGCGCGCT GCCGATCTAT GGGATCGCGC GCGACCGCTT CGATGGTTCG
CGCCTCGCGG GTCTCGGATG GGCGCTTGTT TATCTGCTCT ATCCATCGCT CTGGTTTGTG
ACGATGAGCG CACCATGGCA ACCGCGCACA CTGGCGATCC CGGCGCTGCT CGGCGCGTTT
TTCTTCCTGC AACGCGCCAC GTTCCAGCGC GCAGGCATCC GGGAGTGGGG CGCATATCTG
GGATTACTGG CACTGGCGCT GACCACACGC ACCGATGTGT CGCTGGTGGT GGTCTCGTTC
GGCATTCTGG CGGCGCTGTG GCGCGTCGGC TGGCGCTGGG CGCTCCCGCC GCTCATGATG
GGACTGGCAT GGTTCTATCT TTCGACGAAT GTGATCGTGC CATCGTTTTA CCTTCCCGAT
TATCAGGTGC GCGAGGGAGA GATCGGCGCG GTGACGGAGG GCGATTACAC CGAGGTCTGG
CCCGGCAAAA GCCCGCAACT GGCGTACTAT TCGCATCTCG GCGACAGTGC GGGCGCTATT
CTGTGGACGA TTGTCAGCCG ACCGATTGAT GTGCTGCGTC TGATGTTTAC ACCGGATAAG
TTGGTTTATC TGGCGCTGAT GCTGCTTCCG CTGGCGCTGC TCCCGTTGCT CGCGCCCGAT
GTCGCGATTC TGGCAGCCCC GCCGCTGGCA ATGAACCTCC TCTCGCTGCG CCCGTTTCAG
ATCACGGTGC GTGAACAGTA TCAGGCGCTT GTCATTCCCG GTCTGATTCT GGCGGCGATT
ATCGGCGCGG CGCGGGTATG GTCGTGGTGG CAGGCGCGCA GTGAGCGGAC GCGCGGCGCT
CACGTTTCTG GCATTCAGGA ACGCCGAACA CGCTCGCGCC CGGCAGCAAT GTTCATGATC
GGCGCCATTG GCATTGCAGC GGTCACCAAT CTGGCATACC GCAATCCCGT CGCAACGACC
GTGCTCTACC GTGAGTCGCC GGAGCGTCTT GCGGCGATGG AGCGCCTGGC AGCGTTGATC
CCGCCCGATG CTCCGCTGGG GGTGACGTCG TTTCTGGCGC CGCGCATGAT GCCGCGTCGC
TTCATCTACT ATGTGCCGCC GGACGAGTCA TTTCCACCGC TCGAGCGCGC CGAGTATCTT
TTCATCGACA TGCGCGCTGC GGCGTTGCAC ACCGAACGCG ACCGCGACTT TATTCAGCGT
CTGCGCGAAT CGAGGCGCTG GCAGGTTATT GCCGAAGAGG AGGACCTGCT GGTGCTGCGC
CAGGCGCACC AGGCGCCGTA G
 
Protein sequence
MSRHESLRGS SGKARVLHNL GCDINVAARK PSWIVWQGTS FAHMTQALAS QRERSNTWIA 
HMLTAIERQA SLALAAFIML YVAVLFFGLT FKYFSWAQGY DQIDYQQSIW NTTQGRFLEI
SHYRHTDHLW GMDFIPAILL IVPFYALFPS ALTLNFFQAL FMALGALPIY GIARDRFDGS
RLAGLGWALV YLLYPSLWFV TMSAPWQPRT LAIPALLGAF FFLQRATFQR AGIREWGAYL
GLLALALTTR TDVSLVVVSF GILAALWRVG WRWALPPLMM GLAWFYLSTN VIVPSFYLPD
YQVREGEIGA VTEGDYTEVW PGKSPQLAYY SHLGDSAGAI LWTIVSRPID VLRLMFTPDK
LVYLALMLLP LALLPLLAPD VAILAAPPLA MNLLSLRPFQ ITVREQYQAL VIPGLILAAI
IGAARVWSWW QARSERTRGA HVSGIQERRT RSRPAAMFMI GAIGIAAVTN LAYRNPVATT
VLYRESPERL AAMERLAALI PPDAPLGVTS FLAPRMMPRR FIYYVPPDES FPPLERAEYL
FIDMRAAALH TERDRDFIQR LRESRRWQVI AEEEDLLVLR QAHQAP