Gene Rcas_4155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4155 
Symbol 
ID5541666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5377329 
End bp5378744 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content60% 
IMG OID640896266 
Producttype II secretion system protein E 
Protein accessionYP_001434204 
Protein GI156744075 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.55327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCGAT TTCGTTTTTC GCGCAGCGGC GCGAACAATG CACAGCAATC AGCCAATGCC 
TCACCTTTCG ACGATCTGCG CTCACCGGTC GAATCGGTAC GCCCTGTTGC GCCACGATCG
CCGGTTCCGC CATCGCTCCA CGACGACACC CGTGACGAAC GCGATCTGAT CGAAGCAGTG
CAATCGCAGT TGATCAACGA AACCGACACC AGCGGTCGGC GCGAGCCGGA ATACTACGCT
CGCCGCATTG CCGAACTGGT GACCGAGCAC CTCGAACAAA GCGGACGCGT CGTTTCTGAG
CGTGAGCGCA ACCGCCTGAT TCGCCTGGCA CAGTCCGAAC TGCTTGGGCT TGGTCCGCTC
GAACCGCTGC TCGCCGACGA TACCGTCAGC GAGATTATGG TCAACGGTCC GCATCAGATT
TGGATCGAGC GCAACGGCAA GTTGCAGGAG ACCGACGCCC GCTTCATCGA CGAAGATCAC
GTGCGACGCA TTATCGACCG GATCATCTCG CCACTCGGTC GGCGCTGCGA CGAAACGACG
CCAATGGTTG ACGCGCGCCT GCCCGACGGC TCGCGCGTCA ACGCGATCAT TCCACCGCTG
GCGATCAACG GCAGCACGAT CACCATCCGC AAGTTCTCGC GTATTCCGTT GACTGCCACT
GACCTGATCA CACGTGGGAC AGCATCGCCA GAACTGATGG AACTGCTGCG GGCATGCGTG
CTCGGACGGC TCAACTGCAT CGTTGCCGGC GGCACCGGAA CCGGCAAGAC TACAATGCTC
AATGTGCTTT CGTCGTTCAT CCCCGACGAT GAGCGCATCA TCACCATCGA GAACGCCGCC
GAACTCCAAC TTCAACAGCG CCATGTCGTC ACCCTCGAGT CGCGCCCCGC CAATATCGAA
GGGCGCGGCG AAGTAACCAT GCGCGATCTG GTCGTTAATG CGCTGCGCAT GCGCCCGGAT
CGGATCGTCG TTGGGGAGTG TCGCGCGGGT GAAGCACTCG ACATGCTCCA GGCGATGAAT
ACCGGGCATG ATGGTTCGAT GACCACGTTG CATGCCAACA GCCCGCGCGA CGCACTGCGC
CGCATGGAAA CGATGGTTAT GATGGCGGGC ATGGACCTGC CGCTGCGCGC CATCCGCGAA
CAGATTGCAT CGGCGATCCA CGTTATTATT CAGCTTGAAC GTCTCCAGGA CGGTTCGCGC
AAGATTGTGC AGGTCTGCGA AGTCACCGGG ATGGAGAACG ATGTTGTCTC CCTCTCAGAT
CTGTTCGTCT TCCAGCAGCA GGGGGTGCGC GACGGCAAAG TCGTTGGACG GATCGTGCCG
ACGGGCATTC GACCACGGTT CCTCGAAAAG TTGCAACAGA TGAATATCAC GTTGTCACCG
CAGGTGTTCG GCGCCGTTAT TCCAGGCGTT CGCTAG
 
Protein sequence
MNRFRFSRSG ANNAQQSANA SPFDDLRSPV ESVRPVAPRS PVPPSLHDDT RDERDLIEAV 
QSQLINETDT SGRREPEYYA RRIAELVTEH LEQSGRVVSE RERNRLIRLA QSELLGLGPL
EPLLADDTVS EIMVNGPHQI WIERNGKLQE TDARFIDEDH VRRIIDRIIS PLGRRCDETT
PMVDARLPDG SRVNAIIPPL AINGSTITIR KFSRIPLTAT DLITRGTASP ELMELLRACV
LGRLNCIVAG GTGTGKTTML NVLSSFIPDD ERIITIENAA ELQLQQRHVV TLESRPANIE
GRGEVTMRDL VVNALRMRPD RIVVGECRAG EALDMLQAMN TGHDGSMTTL HANSPRDALR
RMETMVMMAG MDLPLRAIRE QIASAIHVII QLERLQDGSR KIVQVCEVTG MENDVVSLSD
LFVFQQQGVR DGKVVGRIVP TGIRPRFLEK LQQMNITLSP QVFGAVIPGV R