Gene RoseRS_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3839 
Symbol 
ID5210821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4798103 
End bp4799083 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID640597435 
ProductTPR repeat-containing protein 
Protein accessionYP_001278143 
Protein GI148657938 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000120659 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACGGTC GTGTGATCAG GCGGGTGTCG GTAGCGAAGC GGGTCGTGGC ATGCGTGGCG 
CTCGCCATAC TCCTCACGGC ATGCGCCGAA CCGCTGGTCG AACCGCGTCC GCCGCGCACG
CCGACAGCGG TGATCGACCG GTACAACGCC ATCGTTGCGA CTGCTGAGGC GGGCGATGAT
CTGTTGGCGC GCGCCACAGC CTACTATGAG CGCGGCAACG TCTGGTTCGA TCAGCAGAAC
TACACCGCCG CCATTGCCGA CTACGATCGG GCGCTCGCGC TCGACCCCTC TCTGTCGCGC
GCTGTTCACA ATCGTGGTCT GGCGTATGCA CTGTTGGGGG ACGACAACGC CGCGCTGCGG
GATTATGCCG AGGCTATTCG TCTCGATCCC GCCTACCGGC GCGCCTACGA GAATCGGGTG
CGCCTGCTGG AACAACTTGT CGTTGAGAGG ACAGACGAGA CGCTGCTGCA ACAACTGGCG
GACGACTACG GCAAACTCGC GGAACTGATA CCTGAAGCAG AAGCGATGTA CCGGTATCGA
CAGGGGGTAA TCCTGGCGCG GCTGGGTGAC CGATTGGCGG CGCGTGAAGC GTTCGACGCT
GCACTCCGCG CCCGACCGCA GCACGTCGAT GCGCTCTACG AACGCGCACT GTTGCACTAC
GCCGACGGCG ACCTGAATGC CGCCCTCGCT GACCTCGACG CCGCGCTGCG TCTGAGTCCC
CGCGCTGCCA ACGCCTACTA TGTGCGCGGA TTGATCCGCC ACGCTCAGGG AGACATACGC
AGCGCAATCG CCGATTTCGG TCAGGCGCTG GTGCTCCAGC CCGAATACCC CGAAGCGCTC
ATTGCACGGG CAGCGGCATA CGCGGCGCAG GGCAACACCG CTGCCGCCCG CGCCGACCTC
CAGCGCCTCG ATGGATTGAC GCTTGACCCG ACGCTTCAGC AGGCACGCGA GGCGCTGCGG
GTGCAGACAG ATTCGCCCTG A
 
Protein sequence
MNGRVIRRVS VAKRVVACVA LAILLTACAE PLVEPRPPRT PTAVIDRYNA IVATAEAGDD 
LLARATAYYE RGNVWFDQQN YTAAIADYDR ALALDPSLSR AVHNRGLAYA LLGDDNAALR
DYAEAIRLDP AYRRAYENRV RLLEQLVVER TDETLLQQLA DDYGKLAELI PEAEAMYRYR
QGVILARLGD RLAAREAFDA ALRARPQHVD ALYERALLHY ADGDLNAALA DLDAALRLSP
RAANAYYVRG LIRHAQGDIR SAIADFGQAL VLQPEYPEAL IARAAAYAAQ GNTAAARADL
QRLDGLTLDP TLQQAREALR VQTDSP