Gene Rcas_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0531 
Symbol 
ID5537994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp695660 
End bp698479 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content63% 
IMG OID640892693 
Productuncharacterized membrane protein-like protein 
Protein accessionYP_001430679 
Protein GI156740550 
COG category[S] Function unknown 
COG ID[COG5427] Uncharacterized membrane protein 
TIGRFAM ID[TIGR03662] Chlor_Arch_YYY domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACTG CCGTTCTCAC ATACTGGTTC GCTGCTCTCG CGTTCGGCAT CGTTGGTATG 
CCAGCAGCAC GATTGCTTTT TGGCGCACTT CCCGACCGCG GTTACGCCTT CGCCCGGACG
GTAGGGTTGC TGCTGACCGG CTACCTCGCC TGGCTGGCTG CCATGTTCGG ATTGGCGTCC
TTCGGCGCGC CGCTTATCGT CGTGGCAGCG CTGGCAGTGG TGATCGGCGG AGTGCTGGCG
TTGCGCCGCG TGTCAGCGAA AGCAGCGTCG CCTGCGGTTG CCCTGCGCTG GGTGCGTCGC
AACTGGGGCA TAGTCGTGTT TTACGAGGCG CTTTTCGCCG CCGCGCTCAT CTTTCTGGCG
TTGCTGCGAG CACAGGAATA CGGCTTCGTC GGTCCGCACC CCTGGGGCAC CGAACGACCG
ATGGACTACG CCTTCTTCAA CGCCATACGT GTCAGCGCCG TCTTCCCGCC GCACGATCCC
TGGATGTCGG GCTACAGCAT CAACTACTAC TACTTCGGAT ATCTCCTCAT GGCGGCGGTC
TCGCTCGTTA CCGGAGTGCA TCCCGCAGTT GGATACAACA TGTCGCTGGC GTTGACCTTC
GCCCTTACCG CGCTCGGCAT TGCAGGGCTG ATCACCAACC TGGTTGCGCT GGCGACATCG
CAATTACCTG ATGCCCCGCG CGCGACAGAC GTGGAAGTCG AACGTGTGGA TGCGGACGCA
ACGCCGTATT CGGACGGCAT GGCATCCCTC AGTCGCATGC AGGCGGCGTT CGCCGTGCCG
CTCGACCGTG GCGCTTCCTG GTTTTGGGGC ATGCGCCGCG CCAGCACGCA GCAACAGATG
ACGGCACAGC AAGCATCGAC GGAACTTGCG GAACAATCGC CTTCGACTCA GGCGTCCGCC
GATGTCGAAG CAGACATGCC CGACGGACAG GCGCTCCCTA ATAGCGAAGG CGAACGCGCG
CCGGAAATCG ACGGCGCACC GCCGGATGCG CATCGTTGCA TCCCGTGGCA TGGTTGGCTC
GCTGCGCTGC TCACAGTCGT TGCGGTGCTC CTGGCAGGCA ATCAGGCAGG CACATTGCAG
GTTATCGTCG GGAACGAACG CATCGTGGCG CTCGACGGCG CACAACTGGC TGCGGCGCTC
GTGCAGGCGT TGAGCGGCGC CGAAACGATC ACCCTTCCGT ATCCGGCGCG CACCGGCGAT
TTCAATGTGT TCGACACGCT CATCCGGGAA GATCGGATGC GTGACTTTAA CTGGTGGTGG
CCCTCACGGG CGGTGTGGGA CGAGCGCCCT GTCTGGAACC CGGAAACACA GCAGATCGAA
CCTGTGCGCG GCTATGCCAT CACCGAGTTC CCCTTCTTTT CGTTCTGGCT CGGCGATATG
CACCCCCACG TCATGGCGCT GCCGTTTGGC GTACTGGCGC TGGCACTGGC GCTGGCGCTG
CTGGCGAGTT CGGCGCCGCT GCGCCTGTGG CGCAACCGCG CCGAACTTCT GCTGTCCGGT
GTGATTCTTG GCAGCCTGTA TATGATCAAC AGTTGGGACC TGCCGACCTA TCTGCTGCTT
TTCCTGGGAG TGCTGGCGCT CAAAACCGCG ACGCAGCCCG AAACGCCAGC GATCTCCGGC
GAGGCTGCAT CTGGTGTGCC GCTCATCGAC CGACTCGCGC CGCGCTGGCG CGGCTATCTG
ATAAACGCCT TGCTGATTCT CGCAACCAGC GTCGTTCTGA TCGCACCATT TCTGCTCACT
TTCACATCCC TGATTGGCGG GCGTGCACCG CTCATCGATC TGCCGCTGAT CGGCGAGATG
ACCCGCATCC TCGGGTTTGT CACCGGCAAA ACCGGTCTGC ACAGTTTTCT CATCATCTTC
GGCACATTCC TGGCGCCGCT CATCGGTCTG ACGGCGGCGC TGGCGCGCGG CACAGCGCGC
ACACTTCTGA TAGCGTCTGG CGTCACCGTC GTAATCGGCG CCTTCATCGG CTTCCCACTC
GCAGCGCTGC TTCCGCTGGG TCTGGCGGCA ACGCTGATCG CCGGGCAACG CGCCGGGCAT
CCGGCGGATG CGTTCGCGCT GGGGACGCTG GCGCTGGGCA GTGCGATCTG CCTCGGCGTC
GAACTGGTCT ACATTCGCGA CGTGTTCGAG AATCGGATGA ACACCATCTT CAAGTTCTAC
TATCAGACAT GGCTCATCTG GGGCGTGGCA GGGGGCTACG CCGCCTGGCG GCTAATGCAG
ATCGTCGGAA TATGCTGGCG GGCGCAACCC CGCAGGAGCG CCAGGATTGC TGCAATGCTG
GCAATGCCGT TGGTTGCGCT GATTCTTCTG GCAAGTGGGT TGACCTACCC ATGGCTGACG
GCGGGCAAAG CCTTCGCCGA AGGGCGACAC GTCGGACTTG AAGGGCGCAC GCCGCGCGAA
CGGACGCCAG AAGGCGCCGA AGCGATCGCC TGGGTGCGCG CCAACACCCC CGGCGACGCC
GTCATCCTCG AAGCTGTAGG ACCCTCCTAC GACACCGCAG GCATCGGCTA CGGCGGCGTC
TCGTCGAGCA CCGGACGGGC GACCGTGATG GGATGGGAAG GGCATCAGCA GCAATGGCGC
GGCGGCGACC CAAAGGTGTT GGCGGAGATT GCGCCACGCG CCACCGACGT TGCGACAATC
TACAGCACTG CCGACACTGC GCTGGTGCGC GCATTGCTGG CAATCTATGG CGTCGACTAC
ATCTATGTCG GCGAAGCCGA GCGTCAGACA TATCCGGCGG AAGGGCTGGC GAAACTCTCC
ACGCTGGGAG ATGTCGTCTT CCAGAACGAC GAAGTGACGA TCTACCGCGT CCGTCCATGA
 
Protein sequence
MLTAVLTYWF AALAFGIVGM PAARLLFGAL PDRGYAFART VGLLLTGYLA WLAAMFGLAS 
FGAPLIVVAA LAVVIGGVLA LRRVSAKAAS PAVALRWVRR NWGIVVFYEA LFAAALIFLA
LLRAQEYGFV GPHPWGTERP MDYAFFNAIR VSAVFPPHDP WMSGYSINYY YFGYLLMAAV
SLVTGVHPAV GYNMSLALTF ALTALGIAGL ITNLVALATS QLPDAPRATD VEVERVDADA
TPYSDGMASL SRMQAAFAVP LDRGASWFWG MRRASTQQQM TAQQASTELA EQSPSTQASA
DVEADMPDGQ ALPNSEGERA PEIDGAPPDA HRCIPWHGWL AALLTVVAVL LAGNQAGTLQ
VIVGNERIVA LDGAQLAAAL VQALSGAETI TLPYPARTGD FNVFDTLIRE DRMRDFNWWW
PSRAVWDERP VWNPETQQIE PVRGYAITEF PFFSFWLGDM HPHVMALPFG VLALALALAL
LASSAPLRLW RNRAELLLSG VILGSLYMIN SWDLPTYLLL FLGVLALKTA TQPETPAISG
EAASGVPLID RLAPRWRGYL INALLILATS VVLIAPFLLT FTSLIGGRAP LIDLPLIGEM
TRILGFVTGK TGLHSFLIIF GTFLAPLIGL TAALARGTAR TLLIASGVTV VIGAFIGFPL
AALLPLGLAA TLIAGQRAGH PADAFALGTL ALGSAICLGV ELVYIRDVFE NRMNTIFKFY
YQTWLIWGVA GGYAAWRLMQ IVGICWRAQP RRSARIAAML AMPLVALILL ASGLTYPWLT
AGKAFAEGRH VGLEGRTPRE RTPEGAEAIA WVRANTPGDA VILEAVGPSY DTAGIGYGGV
SSSTGRATVM GWEGHQQQWR GGDPKVLAEI APRATDVATI YSTADTALVR ALLAIYGVDY
IYVGEAERQT YPAEGLAKLS TLGDVVFQND EVTIYRVRP