Gene Rcas_3151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3151 
Symbol 
ID5540649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4086301 
End bp4087902 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content62% 
IMG OID640895272 
Productsecretion protein HlyD family protein 
Protein accessionYP_001433223 
Protein GI156743094 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGA TAACGCGCTG GATGAGCATA TGCTGTGTGG TATTGCTGAC CGCCTGTAGC 
ACAACGCAGG CATTGCAGGA GCCGCCAACT CCTACCCCCC TGCCGCCCGA TCCGGCGCTC
GAACGCCCCA CGTACACGGT TCAGCGCGGT GCTATCGAGC AGGTGTTCAC GGTCACTGCC
CGCGCAACGC CGGTCGATAT GGCGCGGCTG GCATTCCGAC GTGATGGGCG CGTCAACGTC
GTGAATGTCA GTCGTGGCGA TGTCGTCAAG GAAGGCGATG TTTTGGCGGA ACTTCAGCAG
GAGGAGGCGC TCGACGAGTT GCGTCGCGCC GAGGACGATC TGGCGCAGGC GCGACGCGAC
CTGGAGAGTG CGCAGCTGGC AAAGGAGAAA CGGATCAAGG AACGCGAACT CGATGTGGAT
CGCGCCAGGC GCGAACTCGA ACGATTGTTG CCTGGCGGTG AGGCGGATCT GTTCAAGGAA
TTGCAGGAAC GCCTCGAAGC GGCGCAGCGC GAGTTGCGCA CCACACGCGA TGATGCGTCG
TGGTCGAAAA CTTCAGCGGA CGAAGCGCTA CGCGACAGCG CCGAAGCGCT TTCCGACACC
CAGAAGGCGT ATAGTGTCGC CTACTGGAAC TGGGACTGGG TGCAGCGCTA TGGGACCGAC
CCGGAGAATC CATTTATCAA GAATGAGGCC GGGGTGCTGG TTCCCAATCG CCTCACCGAG
AAACAAAAGG AAGAGTTTCG TATGAAACTG GTGCAGGCGG AACGCGCATT ACGCGACGCC
GAACGCTCGG TCGAGCAGGC GCAGCGCGCC CGTGATCGCG CATATGAAGA TGAAGTCGTC
AAAATTTACG AGGCGGAGAA GAAAGTCGAG GAAGCGCAGC GCGCGCTTGA GACCCTGCTC
CAGGGCAAGA ACAAAGCAAT CGAAGACGCC CAACTCGCTC TCGAGCGGGC GCAGGTTGCA
CTGGAGGAAG CGCGCAACGA GACGCTCAAC AGCGCACTCC GCGCCGTGGA GAACGCCGAG
CGCGCCCTGG AAAAAGCACG CCGCCGCGTC GATGACGGGC GTGTCATAGC GCCGCAGGAT
GGAACCGTGC TGGCGGTCGC TATCGAGCCA GGAGCAACAG TGACCGCCTT CGAGCCGGTG
ATCGAGATTG CCGATCCATC GAACCTGGAG TTTGCCGCAA CTCTGAGCGC GCAGCAGATG
CGGCTGCTCT CCGAAGGTCA GAGTGTCGAG ATCCGCCTGC TATCGCGCCC CGACCTGGTG
ATCCCCGGCG TTATTCGACG CATGCCGGCG CCTTACGGGT CGGGCGGAAG CGGCGCCGTC
CAGGATCGCG ACGTCACCAC GCGCTTTCAA ATAGTCGATG CGCGCGGACA AACATTCGAG
GCTGGCGTGA CCGTTGCGCG AGTCAGCATC GTGCTGGAGC GCAAAGACAA TGTCCTCTGG
CTGCCGCCAG AGGCGATCCG GTCATTTGAG GGTCGCCGCT TCGTCATTGT CCGTGAAGGG
GAGCGTGAAC GCCGTGTGAC CGTGCGGGTC GGCATCGAAA CCGATGAGCG GGTCGAGATT
CTCGAGGGCC TGAACGAAGG CGATATTGTC GTTGGATCCT GA
 
Protein sequence
MRWITRWMSI CCVVLLTACS TTQALQEPPT PTPLPPDPAL ERPTYTVQRG AIEQVFTVTA 
RATPVDMARL AFRRDGRVNV VNVSRGDVVK EGDVLAELQQ EEALDELRRA EDDLAQARRD
LESAQLAKEK RIKERELDVD RARRELERLL PGGEADLFKE LQERLEAAQR ELRTTRDDAS
WSKTSADEAL RDSAEALSDT QKAYSVAYWN WDWVQRYGTD PENPFIKNEA GVLVPNRLTE
KQKEEFRMKL VQAERALRDA ERSVEQAQRA RDRAYEDEVV KIYEAEKKVE EAQRALETLL
QGKNKAIEDA QLALERAQVA LEEARNETLN SALRAVENAE RALEKARRRV DDGRVIAPQD
GTVLAVAIEP GATVTAFEPV IEIADPSNLE FAATLSAQQM RLLSEGQSVE IRLLSRPDLV
IPGVIRRMPA PYGSGGSGAV QDRDVTTRFQ IVDARGQTFE AGVTVARVSI VLERKDNVLW
LPPEAIRSFE GRRFVIVREG ERERRVTVRV GIETDERVEI LEGLNEGDIV VGS