Gene Rcas_0362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0362 
Symbol 
ID5537824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp447823 
End bp451050 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content61% 
IMG OID640892525 
ProductTPR repeat-containing protein 
Protein accessionYP_001430512 
Protein GI156740383 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.574464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.999715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCACGTT CGTCTGTTTC GATCTGGTGT GAGCGCGTCA TCGAGGGTGG CTGGCTGCTT 
GCCCTTTTGC TGATACCAAC CTACTTCAAT TTGTTGTCGG CGCGTCACTT CGAGCCGGAC
AAAGCAACGA CGTTGCGCTC GATTGTCATG GTCATGCTGG CGGCTGCGCT GATCGATGCG
CTCGAGCGTT TTGCAAATCG GCGCGTTACC GTAGAAGACG CATCCAAACC CCCTTGGTGG
CGGCGCATCG CTTCCATACC GCTTGCTGTG CCGACCAGTA TGTATGCGGC AGTCTTTCTC
CTGGCGACCG CAGTGTCAAT CGCGCCGTCG GTCAGTTTCT GGGGATCGTA CCAGCGCCTT
CAGGGAACCT TCACCAACCT GTCGTACATC GGACTGGGCG TGCTCATTGC CTTCTATCTC
CGTCGTCGTG AGCAGATTGA TCGCCTGGTG ACGATCATGA TCCTCGCCGG TCTGATCGCT
GCATGGTACG GTCTTGTGCA GCATGTGCAA CTCGATCCGC TTCCCTGGCG CGGCGATGTC
GTGTCGCGTG TGGCATCCAC GATGGGGAAC TCGATTTTTG TGGCAGCCTA TATGATTATG
GTGCTGCCCT ATGCGCTCTA CCGTGGAGTC GTCGCTCTCT ATCAGGCGCG GAACGCTGAT
GCTTTGCCCT CCCGTCCGTC CATCGATCTG GGATGGGGGG CGGCATATGT CCTCCTGGCG
CTCTCTGCTC TGGCGCTGGT GTTTGCGGCG ATGATGTTCG GCGCCGTCGT GCGCGCCGCC
GATCTGCGCT ACTGGTGGGT TTACCCCGGT GCGCTGATTA TCGCCGGCGG GTTGTTCCTG
CTGATTGCGC TGGCGCCGCA TCGCGCCGAA CGCCTGACCT TCCGGACACT CATTCCGGGC
ATTCTGCTGC TCGCGTATGT CGTTTTCCTC GGTGTGTCGT ATGCTCTGGG GCAGGGACCG
AATCAGCGCG TGGTTCCGCT CGAAGGTCGT CCTGGCGTCG AATGGCCCCT CTGGTTGGGC
ATCGCTGTGG CGCTCATGGC GTCCGGGTAT GCGCTGGTGA TCCTGCTTCC CCGGCGCGCA
AACGGGACCT CGCGCCTTGG TCTCGTCCTC GAATCGGTCG GCGCATGGAC CATTGCAACG
TTGCTTCTGG CAGCCATCTT CTTCACGCAG AGCCGTGGAC CGTGGCTGGG CGGCTTTGCG
TCGCTATTTG TATTCTTTAC CCTGCTCCTG ATTTTGGCGC TGCGTCGGGC GCAGCAGCAG
AATCAGGAGC GCGCGCGCCT CTGGCGCAGG TTGCTGATCG GCGAAGTGGT GGTCGCGCTT
GCGCTGGCAG GCTTTCTGGC AGCCTTCAAT CTTTCCGATG CACCGCTATT CCAGCAGTTG
CGTGACGTTC CGTACATCGG TCGCATGGGG CGTCTGCTCG AAGTTGAGAC CGGCACCGGT
CGAGTGCGCT GGTTGATCTG GTTTGGCGAC GACAAAGCCG GCGGCGCAGC CGCGCTCATT
CGCTCCGATC CGCTTCGCAT GATTGTCGGT TGGGGTCCTG AAACCATGTT TGTGGCGTAC
AACCCCTTTT ATCCTCCGTC GCTGACTAGC CTGGAAGCAC GGACAGCGTC GCCGGATCGG
TCGCATCAGG CGTACCTCGA TGAAATGATC AACAAGGGCC TGCTGGGTCT CATCAGTTAT
CTGTTCCTGT TGTTCAGTTT CTTTGCGCTG GCATGGAGTC TGGCGAATCG TGTGACGGAC
TGGGGTCTCC AGGTGCTGAT TACGGCGGCG ATTGCTGCGG TGACGGCGCA TGTTGTCGAA
GGGTTGACCG GTATTCCCAT TGTATCGACG CTCATGCTTC AATGGGTGTC GTTCGCGCTT
GTAGTGGCGG TTGGCGCCAT CGCAGGCGAG TACCGTATGC CGGGTGCTGC GCCGGTTCAC
CCGGGCGCTG GACCGGCGCC CGCTGCACCT GTTCCGGCAA CGTCGGTGCG GCAGCGCGGC
GGACGACGTG CTCAGGCGGT CAAACGTGGA TCATCGTCAA CCGGTCGATC AACGGTGCCA
TCCCGCGTCC GCTCCGAGGG ACGCGCCGCA GCGATGGGGG TCTATGCCAT CATTCTGATC
CTGGTCTTTT TCGGGGTCTG GACGTCCAAT GTGGACAGCG TGCTGGCGGA CATGCGATTG
CAACAGGCAC AGGGGTATAT CGACAGCCCA GGCGCCCGAC TGGAACAACA GATCGTCGGT
GCAAGTTATC TGCTCGATGC CATTCGCATG GAGCCGGATC AGGATTTCTA CTACCTGACG
CTCGGTCGGT CGTTGATGAC CATGGTCTCA TTGCGCGCCG ACGAAGCGCG CCGCTCTGGC
GCTTCACTTG GGCAGATCGA TCCGAACGCC AGTGTCGCCA CCCTGTTGCA ACTCGATGAC
ACAGCGGAAC TTCAGGCGTT TGTGCTGAGG CGCACGCCGA TGGAACTCCT CAGTTACGCG
CGCGCTGTTC TGCTTCGCGC CCAGCAGATC GCGCCGCTCA ACAAGGACCA TGCGGCCAAC
CTGGCGCGCA TGTACAACTT CTGGTATCAG ACTCTCAGCG GTGGTCAGGA TCGCGACGCA
CTCGCGCAGG CGATCGAGTG GTATCAACGG GCGCATGCGA TTGCGCCGCA AGATGTCGCC
ATTCTGAACG AGTATGCCAG CGCTGTCGCG CGATCCGGCG ACTACGACAA ATCCCTGGCG
CTGCTCGAAA CATCGCGCAA TCTTGATCCG CTCTACAACG ATACCCTGTT GCGGATGGGC
GAAGTTCTGC GGGCGCAAGG TCGCTTTGCA GAAGCCGTCG ATCAGTATCT GACGCTGCTG
GATCGCGATC CGTCTGCACT TAATGGGCAG ATAAGCGCAA TTGCAACGGT GCTTTCGTCC
GACCCGGTAC AATTGGCTCG CCTGCGCGAT CGATACCGTG CAGCACAGGC GTCGCGCCCC
GATGATCCGC AACTGGCGTC GATTGTTGGT CTGCTCTCGG TGCGGTCTGG CGACCTCGAA
ACCGCCGCAA GCGCGTTTGC CGAAGCGGTG CGATTGCAGC CGGACAACCT GGAAGCGCGC
CAGAACTATA CCATTGTGTT GAGCGATCTC CTGCGGTATG ATCAGGCGGC GCAGCAGGCG
CAAGAGTTGC TGACGCTCGC ATCGCAGCGT CAGGAGACGA CCGATCAACA ACGCGCGGCA
ATCGAGGCGT TGCTCGGCTA CCTTCGGCAG CGCGCCAATG GCGGTTGA
 
Protein sequence
MSRSSVSIWC ERVIEGGWLL ALLLIPTYFN LLSARHFEPD KATTLRSIVM VMLAAALIDA 
LERFANRRVT VEDASKPPWW RRIASIPLAV PTSMYAAVFL LATAVSIAPS VSFWGSYQRL
QGTFTNLSYI GLGVLIAFYL RRREQIDRLV TIMILAGLIA AWYGLVQHVQ LDPLPWRGDV
VSRVASTMGN SIFVAAYMIM VLPYALYRGV VALYQARNAD ALPSRPSIDL GWGAAYVLLA
LSALALVFAA MMFGAVVRAA DLRYWWVYPG ALIIAGGLFL LIALAPHRAE RLTFRTLIPG
ILLLAYVVFL GVSYALGQGP NQRVVPLEGR PGVEWPLWLG IAVALMASGY ALVILLPRRA
NGTSRLGLVL ESVGAWTIAT LLLAAIFFTQ SRGPWLGGFA SLFVFFTLLL ILALRRAQQQ
NQERARLWRR LLIGEVVVAL ALAGFLAAFN LSDAPLFQQL RDVPYIGRMG RLLEVETGTG
RVRWLIWFGD DKAGGAAALI RSDPLRMIVG WGPETMFVAY NPFYPPSLTS LEARTASPDR
SHQAYLDEMI NKGLLGLISY LFLLFSFFAL AWSLANRVTD WGLQVLITAA IAAVTAHVVE
GLTGIPIVST LMLQWVSFAL VVAVGAIAGE YRMPGAAPVH PGAGPAPAAP VPATSVRQRG
GRRAQAVKRG SSSTGRSTVP SRVRSEGRAA AMGVYAIILI LVFFGVWTSN VDSVLADMRL
QQAQGYIDSP GARLEQQIVG ASYLLDAIRM EPDQDFYYLT LGRSLMTMVS LRADEARRSG
ASLGQIDPNA SVATLLQLDD TAELQAFVLR RTPMELLSYA RAVLLRAQQI APLNKDHAAN
LARMYNFWYQ TLSGGQDRDA LAQAIEWYQR AHAIAPQDVA ILNEYASAVA RSGDYDKSLA
LLETSRNLDP LYNDTLLRMG EVLRAQGRFA EAVDQYLTLL DRDPSALNGQ ISAIATVLSS
DPVQLARLRD RYRAAQASRP DDPQLASIVG LLSVRSGDLE TAASAFAEAV RLQPDNLEAR
QNYTIVLSDL LRYDQAAQQA QELLTLASQR QETTDQQRAA IEALLGYLRQ RANGG