Gene Rcas_4356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4356 
Symbol 
ID5541869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5607920 
End bp5610841 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content60% 
IMG OID640896462 
Producthypothetical protein 
Protein accessionYP_001434398 
Protein GI156744269 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0703153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA CATCCTCTCC CGGTTCGTCA CAACCTGTCG TCTCCAGTCT CTCGCTGAGG 
CGCTGGAGCA AATGGTGGTC CCCCGGATCA CTCATCGGCG TGTTGCGCGT CGTCAGCCGA
CGGTTGTGGA GCCACCTGGG GCTGATGCTC GCCATCGCCG TCGGCTTCAT CGTCGCTATC
GGGTTGACGG TCAGTATTCC GGTGTATGCC GAGGCGGTCG GCTATCGTAT CCTGCGTGAT
GAACTGGCTC AGGGGGAAGC AGGCTCCAAA CGACCGCCGT TCGCCTTTAT GTTTCGTTAC
CTCGGCTCGC AAACCGGCGT CATTTCCTGG CGCGACTATG CCCCGCTCGA TGAGTATATG
CGCACCCAAC TGGCGGAACG CCTTGGTCTG CCGATTATGA CAGAGGTGCG CTACGTCGCC
ACGGACAAAG CGCCCCTGAT GCCCGCCGGC GGCGTCGGGA AGCCGCTGAT CTTTGTCAAT
ACCGCCTTCG CCACCGATTT TGAGCAGCAC ATTGATATTA TCGATGGTGC ATTTCCACAA
CCGGCATCGA CGGACGGACC GATTGAGGTG TTGATTTCGG AAGATCTGTC GGGACGGCTT
GGCTTTCAGG TTGGAGAGGA GTACCTCATC CTTGGTCCAC AGGAACAGCG CGTCGATCAG
AGTTACCCGA TCCGTATCGC AGGAGTCTGG CGCGTGCGTG ATTCCGGCAG CGACTACTGG
TTCTACGATC CGGTGACTCT CTACGATACA CTCTTTGTGC CGGAAGAGAG TTTTCGTGAC
CGTCTGACAG CGATTAATCC AAAACCGATC TATGTTGCTA CGTGGTATGC GATTGCCGAC
GGGAGCAGTG TGCGTTCTTC GGATGTCGCT GATGTGCGCG CGCGCATCAA TCGCATCGCC
ACCGATATCA CCACCATTTT GCCCGGATCG CGCATCGACA TCTCGCCAGC GCAGGCATTG
GCAGAGCATC AGCGCCAGGT ACAGCGGCTG ACCCTCATTC TGACGATCTT TAGCGTTCCG
GTGCTTGGTC TGATTGCGTA TTTCATTCTG CTGGTCGCCG GTTTGGTCGT GCAGCGCCAG
AGCAACGAGA TTGCTGTATT GCGCAGCCGT GGCGCTTCGC GCCTTCAGGT GCTCGGTATC
TACCTGATCG AGGGCTTGCT GCTCGGCATC GCCGCACTGG CGGCGGGGGT CGCCGTGGGG
CAGGGCGCCG GGTTGCTGAT GACCTGGACG CGCTCGTTCC TCGATGTCCA ACCGGGTGAG
TGGTTGCCGA TTGAACTTAC TCCCGACGCC TGGCAACGCG CCTGGCAAAT GCTGATAGTG
ATGGTGCCTG CAAGCCTGTT GCCTGCGTTC GGAGTCGCCC GCTATACGAT TGTGTCGTTC
AAGAGCGAAC GCGCGCGGGC GACGCGCAAA CCGTTCTGGC AGCGCATGTA CCTCGATCTG
CTCTTGCTGC TGCCGGTCTA TTATGGTTAT ACGCTCCTGG AACAGCGTGG CACAGTGGCA
TTCCTTGGCG CGGCTGATGA TCCGTTTGGC AATCCGCTGT TACTGCTGGC GCCGACCCTC
TACATGTTCA CGCTGGCGCT GGTGGCGACG CGCATCTTTC CGCTGGCGAT GAGCGCACTC
GAGTGGCTGG CGCGGCATAT GAGCGGTGTG GCGACGGTGA CGGCGCTCCG GTATCTGGCG
CGTACTCCCG GCGCCTACAC CGGTCCGGTG TTGCTGCTGA TCCTCACCCT CAGCCTTGCA
ACATTTACCG CGTCTATGGC GCAAACGCTC GACCGCCATC TGATCGATCA GGTGTACTAC
GAATCCGGCA GCGACATTCG GTTGTACGAT CTGGGACAGA GCGGCGGTTT CTCCGGTCCT
ATGGCGGGGA TGCAACCGCA ACAGCCGCTG GTGTCCGATG GCATGCAGGA GGCGCGCTTC
ATGTTCCTGC CGGTCACCGA TTACCTGACC ATCCCCGGCG TCGAGGCGGC GACGCGCGTA
TCGGTCAGTC AGGTTGAGAT CACTCTGGCC AATCGCACCA TCCCTGCTCG TTTCATCGGT
GTTGACCGGG TGGACCTGCC CGCCGTCATT CACTGGCGCG CCGATTATGC CGGTGAGTCA
CTCGGCGCGC TGATGAACCG TCTGGCGGAC GACCCTTCCG CCGTGCTGGT CAATAGCGCG
TTTGCGGCGC AAAACCGTCT GCGTCCCGGC GACCGCTTTG AGGTGGTCAT GAACGACCTC
GACCGGAAGG TTCGTGTGCC GGTGATCGTT GTCGGGTATG TGAACCTCTT CCCGACGGTC
TATCCAACTG ATGGTCCGTT CTTGATCGGA AATCTCGACT ATGCCTTCGA TATGCAGGGC
GGGCAATATC CCTACGATGT CTGGCTCCGG TTGGCGCCGG GTGTTGAGCG CCAGACGATT
GACGAAGGGC TGCGCGAACT GGGGTTGCGC ACCTTCGAGC GTGGCTTTGC GCCGACGATC
ATCGTCGCCG AGCATGCGCG CCCCGAACGA CAGGGTTTCT ACGGCTTGCT GTCAGTCGGG
TTCATCGCCT CGGCATTTCT GACGGTGCTG GGGTTCTTGT TCTATTCAGC GCTCTCATTC
CAGCGCCGGT TCGTCGAACT CGGTATGCTG CGCGCCATCG GGCTTTCGAC CCGGCAACTC
GGCGCGTTGC TGGCGTGGGA GCAGGCGCTG ATTATCGGCG CCGGCATGAT TGGCGGCACG
CTGATCGGCG TCACTGCCAG TCAGTTGTTT ATCCCGTTTT TACAGGTCCG TCGTGGCGCC
AACGCGCAAA TCCCGCCATT TGTCGTCCAG ATCGCGTGGG AGCAGATCGC CATTATCTAC
ATGGTCTTTG GCGCGATGCT GATTGCGGCT GTGCTGATTA CCATTGCTCT GCTCCGGCGC
ATGAAACTGT TCCAGGCAGT CAAATTGGGA GAAGCGATCT GA
 
Protein sequence
MATTSSPGSS QPVVSSLSLR RWSKWWSPGS LIGVLRVVSR RLWSHLGLML AIAVGFIVAI 
GLTVSIPVYA EAVGYRILRD ELAQGEAGSK RPPFAFMFRY LGSQTGVISW RDYAPLDEYM
RTQLAERLGL PIMTEVRYVA TDKAPLMPAG GVGKPLIFVN TAFATDFEQH IDIIDGAFPQ
PASTDGPIEV LISEDLSGRL GFQVGEEYLI LGPQEQRVDQ SYPIRIAGVW RVRDSGSDYW
FYDPVTLYDT LFVPEESFRD RLTAINPKPI YVATWYAIAD GSSVRSSDVA DVRARINRIA
TDITTILPGS RIDISPAQAL AEHQRQVQRL TLILTIFSVP VLGLIAYFIL LVAGLVVQRQ
SNEIAVLRSR GASRLQVLGI YLIEGLLLGI AALAAGVAVG QGAGLLMTWT RSFLDVQPGE
WLPIELTPDA WQRAWQMLIV MVPASLLPAF GVARYTIVSF KSERARATRK PFWQRMYLDL
LLLLPVYYGY TLLEQRGTVA FLGAADDPFG NPLLLLAPTL YMFTLALVAT RIFPLAMSAL
EWLARHMSGV ATVTALRYLA RTPGAYTGPV LLLILTLSLA TFTASMAQTL DRHLIDQVYY
ESGSDIRLYD LGQSGGFSGP MAGMQPQQPL VSDGMQEARF MFLPVTDYLT IPGVEAATRV
SVSQVEITLA NRTIPARFIG VDRVDLPAVI HWRADYAGES LGALMNRLAD DPSAVLVNSA
FAAQNRLRPG DRFEVVMNDL DRKVRVPVIV VGYVNLFPTV YPTDGPFLIG NLDYAFDMQG
GQYPYDVWLR LAPGVERQTI DEGLRELGLR TFERGFAPTI IVAEHARPER QGFYGLLSVG
FIASAFLTVL GFLFYSALSF QRRFVELGML RAIGLSTRQL GALLAWEQAL IIGAGMIGGT
LIGVTASQLF IPFLQVRRGA NAQIPPFVVQ IAWEQIAIIY MVFGAMLIAA VLITIALLRR
MKLFQAVKLG EAI