Gene Rcas_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1709 
Symbol 
ID5539187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2204275 
End bp2206113 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content60% 
IMG OID640893848 
Producttype II secretion system protein E 
Protein accessionYP_001431819 
Protein GI156741690 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.708593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATAA CCCGCGTCAC TCCGAAAGCT GGCTCTACCT CGCCTCGATC CGCCTCCTCG 
GCCGCCGTCT CGCCAAAGCG TCGTGAGCCA GGGCACTCTT ATCATTGGAC TATTGGAGAG
CTTCTTATTC CGGCATGCGA ACGCATCGTC ACGATCGAGG ATGCAGCCGA ACTGCGGTTT
CATCGCACCC ATCCACACGT GGCGCGCCTT GAGGCGCGTC CACCGAATGT CGAAGGCGCT
GGCGAAGTCA CGATTCGCCA ACTGGTGCGT AACGCGCTGC GTATGCGCCC GGATCGGATC
ATCGTCGGCG AGGTGCGCGG CGCTGAGGCG CTCGACATGC TCCAGGCGAT GAACACCGGC
CACGAAGGCT CGATGACGAC GGTGCACGCC AACTCGCCGC GTGATGCGTT CAGCCGCCTG
GAGACGATGG TTATGTGGGC TGAGGGCGCC AGGGAGCTGC CGCTCAGCGC CATTCGGGAA
CAACTCGTCG GCGCGCTCGA TATTGTCATT CAACAGACGC GCCTGCCCAA CGGTCGTCGC
AAGGTTATCA GCATTTCCGA GGTTCAGGGC ATACGCCACG GTGAAGTCGA ACTGCGCGAC
ATTTTTGTGT TTCACAGCAG TGTTGATGAC AAAGGTCAGG TGCTTGGCGA GTTTATGGCA
ACCGGCGCCC TGCCGCGGTG CCTGCGCAAG ATTGAACCGG CGTGCGGCGC ACTCGATCAC
CTGTTCAAGT CGAATTATCT GCGTGATACG CTTGGACCGG AGATTCTGCA GAACCCGGCG
ATTACCGAGA TTATGGTCAA TGGTCCATTT GATGTCTGGA TCGAAGAGCG CGGCAAACTG
CGTCCGGCGC CGGAGATCCG CTTCCGCGAC CATCACCATC TGCTGAATGT CATCAACACG
ATCATCGCAC CGCTGAACCG TCGTCTTGAT GAGTTGAATC CGATGGTCGA CGCGCGTCTG
CCGGATGATG AGCGCTTTCC CGGCGGTGGT CGCATCAACG CCGTTCTCGA CCCGATTTCT
CTGGTTGGTC CGGTGTTGAC CATCCGGCGC TTCAGCCATA CCCCATTTAC GCTCGATCGA
TTGGTGGCGC TTGGCAGTAT GTCGCCACAG ATGGCGGCAT TTCTGCGCGC GTGTGTTCGC
ATCAGACGCA ACATGCTGAT CTCTGGCGGC GCCGGCAGTG GGAAAACCAC GTTGCTGGGC
GCGATTGCAA AGGAGATCGA TCTGCAGCGT GAGCGCATTA TCACCATCGA AGACGCCGCA
GAACTGCGCA TTGGCGCGCC AGGGGACCAC GTGCTCGGTC TCGAGACCCG TCCGCCCGAC
CGATTTGGCG AAGGTGAGGT GACGATCCGC CAGCTGGTGC GCAATGCACT GCGTATGCGC
CCCGACCGGA TTATTGTCGG CGAGGTGCGC GGCGCTGAGG CGCTCGATAT GCTGCAAGCC
ATGAATACCG GGCACGAAGG CTCGCTCACT ACGCTGCACG CCAACTCGCC TCAGGAAGCG
TTCAGTCGAT TGGAGACGAT GGTTCTCTGG GCGCGCGAAG CCGAGGCGCT CTCGCTCTCC
GCCATCCGAC GTCAGTTGTG CACGCTCGAT ATTGTGGTTC AGCAGGCGCG CCTTGCCGAC
GGCAGCCGCA AAGTCGTCGC TATTGCCGAA GTGGTTGGTC TTGATGAGCG TGACCAGGTG
CACGTCGAGG AGATTTTTCG GTTCGAGCAG CATGGCATCG ATCCTGACGG CCAGGTGGTG
GGTGAGCATG TTGCAACCGG CTACGTGCCG CGGGTGCTGG AAAAGTTGTG TGCATATGGC
ATCACGCTGG AGGAAAAGGC ATGGTGCAGC CGATCATGA
 
Protein sequence
MTITRVTPKA GSTSPRSASS AAVSPKRREP GHSYHWTIGE LLIPACERIV TIEDAAELRF 
HRTHPHVARL EARPPNVEGA GEVTIRQLVR NALRMRPDRI IVGEVRGAEA LDMLQAMNTG
HEGSMTTVHA NSPRDAFSRL ETMVMWAEGA RELPLSAIRE QLVGALDIVI QQTRLPNGRR
KVISISEVQG IRHGEVELRD IFVFHSSVDD KGQVLGEFMA TGALPRCLRK IEPACGALDH
LFKSNYLRDT LGPEILQNPA ITEIMVNGPF DVWIEERGKL RPAPEIRFRD HHHLLNVINT
IIAPLNRRLD ELNPMVDARL PDDERFPGGG RINAVLDPIS LVGPVLTIRR FSHTPFTLDR
LVALGSMSPQ MAAFLRACVR IRRNMLISGG AGSGKTTLLG AIAKEIDLQR ERIITIEDAA
ELRIGAPGDH VLGLETRPPD RFGEGEVTIR QLVRNALRMR PDRIIVGEVR GAEALDMLQA
MNTGHEGSLT TLHANSPQEA FSRLETMVLW AREAEALSLS AIRRQLCTLD IVVQQARLAD
GSRKVVAIAE VVGLDERDQV HVEEIFRFEQ HGIDPDGQVV GEHVATGYVP RVLEKLCAYG
ITLEEKAWCS RS