Gene Rcas_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1973 
Symbol 
ID5539451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2525747 
End bp2527591 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content65% 
IMG OID640894108 
Producthypothetical protein 
Protein accessionYP_001432079 
Protein GI156741950 
COG category 
COG ID 
TIGRFAM ID[TIGR02226] N-terminal double-transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTC TTCTGCCACT CGGATTACTG GCATTGCTTG CCCTGCCGCT CATCGTGCTG 
CTCCATTTCC TGCGCGAACG GCGGCGACGC GTGCCAACAC CAAGCCTGCT GCTCTGGGCA
AACCTGCCAC GCCGCGTGGA AGGCGAGCGC AGCCGCCGCC TGCCGCTGAC CCTGCTGCTG
CTCCTCCACC TCCTGATTGC CACGCTGCTC GGCGTTGCGC TGGGGGGACC GCAGATCACC
GGTGCGCTCA CGCCCGACGC GCGCCATACC GCCATTATTC TCGACACATC CACCAGCATG
GCAGCCGTTG ACGGCGGCGC GAGCCGTTTC GACCAGGCGC GCCGGCGCGC ACGCGCTATT
GTCACCTCTG CCTCTCCCGG TGACCGGATC ACGCTGATCG CTGCCGGACC GCGGGCGCAG
ATCGTGGCAT CCGGCGACGA CCCCTTACTG ATCACTGCCG CGCTCGACCG CCTTCAGCCC
GGCGGCGTTG GCATGGCGAT CAATGAGGCG TTGACGCTGG CAGAAGCCGT GCTCGACCCA
CAGTTCAGCC GACGGATCGT GGTCCTGACC GATAGCGCGT TACCGCCGCA ACCTGCGCGC
GATATGATTG TGCCCATCGA ATGGGTGTCG ATAGGATCGA ATGTGCCGAA CCGCGCGATC
ATTGCGTTTG CCAGCCGTCC CTGGGGCGGT CGCCTCCAGG TGTATGCGCG GGTCGCCAAT
TATGACGCCA CAGCCTTCAA TGGAACGCTC CAGGTCTTCT CCGACAATCA GGTCGTTGCA
GAAGAACGAG TCGCCATTGC GCCGAACGGC GAAACGGAGG TCAGTTGGAC GCTGCCCGGC
GGAATTGAGG CGCTGCGCGC CACCATCGAT GGGCGCGATG CGCTGCCACA GGACGATGTC
GCATACCTCA GCGTGTCGCA GGGGCGCCCG ATTCTGGCGT TGCTGGTGTC GAACGAGCCG
GCCGCCCTTC GCCGCGCGCT CGCAGCCATA CCCGGCGTGA CGGTCGTTGT GACGAACCCT
GCCGCCTACG CGGACACGCC GGAGCGATCT GCCGCAGACC TGACAATCTT CGATGGTTTT
CTGCCGGATG CCTGGCCCCA GGGCGCTATT CTGTCAATCG CCCCACCCTC TGGATCGTCG
CTGTTGAATG TAGCGTCCGA CACACGCGAA CCAGAACCGG GCAAGCCGTT GCACCAACGA
GGGAATACGC TCCAGGGGAT CGAGTTCGGC GGCGTGGTGT TTGGCGCCGT CCGCATCGTC
GAGGCGCCAC CATGGGCTGA GGTGCAGTTG TCGTTCGAGA ATACGCCGCT GATCCTCCGA
GGACGAACCG ACAATCACGA AATTGCGATC TGGACGTTCA ATCTTGCCAG CAGCAACCTG
ACGACACGCC TGGCATTTCC GATCCTGGTT GCGCGCACCG TGCGCGACCT GGCGCCACCG
CCGTTGCCGC AGGCGGTGCG CGCCGGCGAG CCACTGGTCA TCCGACCCGA CCCGCGCACG
ACAACCCTGC GACTGCGTGG TCCTGACAAC CGGCAGATTA CCGCGCCGGC AGCATCGGTT
GTCACCCTCG ATACGCTGAT CGAGCCGGGG TTGTACCGCG TGGAAGAACA ACGCAACAAT
ATCACCGTTC CGGTTGGCAT GGTCGGAGTC AATGCAGGAG CGGCAATCGA ATCAAACCTG
CGCCCACAGA ACGCACCGCC GTTGCGTGCG CCGGGAACCG ACCCCGGCAG CGCAGCGGGA
CGACAGACGC TCGATCTATG GCCCTGGCTG GCGCTGGCTG CGCTCCTGGT TCTGGCGCTG
GAATGGGCGT ATGTGTTGCG CCGACGCGAG AAAGTGTTCA CATGA
 
Protein sequence
MSFLLPLGLL ALLALPLIVL LHFLRERRRR VPTPSLLLWA NLPRRVEGER SRRLPLTLLL 
LLHLLIATLL GVALGGPQIT GALTPDARHT AIILDTSTSM AAVDGGASRF DQARRRARAI
VTSASPGDRI TLIAAGPRAQ IVASGDDPLL ITAALDRLQP GGVGMAINEA LTLAEAVLDP
QFSRRIVVLT DSALPPQPAR DMIVPIEWVS IGSNVPNRAI IAFASRPWGG RLQVYARVAN
YDATAFNGTL QVFSDNQVVA EERVAIAPNG ETEVSWTLPG GIEALRATID GRDALPQDDV
AYLSVSQGRP ILALLVSNEP AALRRALAAI PGVTVVVTNP AAYADTPERS AADLTIFDGF
LPDAWPQGAI LSIAPPSGSS LLNVASDTRE PEPGKPLHQR GNTLQGIEFG GVVFGAVRIV
EAPPWAEVQL SFENTPLILR GRTDNHEIAI WTFNLASSNL TTRLAFPILV ARTVRDLAPP
PLPQAVRAGE PLVIRPDPRT TTLRLRGPDN RQITAPAASV VTLDTLIEPG LYRVEEQRNN
ITVPVGMVGV NAGAAIESNL RPQNAPPLRA PGTDPGSAAG RQTLDLWPWL ALAALLVLAL
EWAYVLRRRE KVFT