Gene Rcas_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4156 
Symbol 
ID5541667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5378759 
End bp5379730 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content59% 
IMG OID640896267 
Producttype II secretion system protein 
Protein accessionYP_001434205 
Protein GI156744076 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4965] Flp pilus assembly protein TadB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.625918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.530435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCGC CTGTCTCGCT CGATCTTGTA CTTCCGATTG GCGCCGCGCT GCTGGCAGGA 
CTGGCAGTTC TGACGCTGAT ATTCGCACTC CACAGCCTGG CAGCGCGCGA CACCGGCGTC
GATAGCCGGA TCGCCGCCTA CCTGGGAGGC GGTCGTGCGG AGCGCAGTAG CACACTTAAC
GATCAACAGA TCGCCGAACG ACTGAACGAG GTCATCAAAC GGCAGAGTTT CGCTTCCCGG
ATCGAACACG ACCTTGCCGC CGCCAGTCTG CCGCTCACCG TGCCGGAATA TCTGCTGATG
CGCATTGCTG TGCCGCTGAT CGCCACATTG CTGGCGTTGC TGATCTGGCG CCAGGCGTTG
ATTGCACCGG CAGCACTGAT TATCGGTCAC CTGGCGCTCT CTTTCTGGAT GCGCATCCGC
CGTCAGCGTC GCAAGCAGGC GTTCAGCGAT CAGTTGCCCG AAACGCTTGA TTTGATCACC
GCATCGATGC GCGGCGGGTT CAGCCTGGTG CAGTCGCTTG CCAATGTTGC CGGTGATGTG
CAGGAACCGA TGCGCACCGA ACTGCGGCGC GTGTTCCAGG AAGTGCAACT CGGCTTGAGC
ATCACGCAGG CGCTCGACAA TCTGGTGCAG CGCATGGAGA GCACCGACCT CGACCTGGTA
GTGACGGCGA TCAAAATTCA CGCCCGCGTT GGCGGCAATC TGGGGCAGAT TCTCGAGAAC
ATCAGCACCA CTATGCGCGA ACGCGCCAAA CTACGGCGCG AGGTGCGCGT CATCACCTCA
ATGCAGCGCA TCTCCAGTTA TGTTATTGGC GCCTTGCCTT TCGCGCTGGC GTTGATCATT
TTCACGATTA ATCCGACGTA TATGATGCGG CTGTTTCAGC CGGGGTTGAT CCTTTGCATC
CCGATTGGTG CGTTCGTTTC ATCGGTCGCC GGTTTTCTCA TCATTCGGAA GATCGTTGAT
GTGAGGATCT GA
 
Protein sequence
MDSPVSLDLV LPIGAALLAG LAVLTLIFAL HSLAARDTGV DSRIAAYLGG GRAERSSTLN 
DQQIAERLNE VIKRQSFASR IEHDLAAASL PLTVPEYLLM RIAVPLIATL LALLIWRQAL
IAPAALIIGH LALSFWMRIR RQRRKQAFSD QLPETLDLIT ASMRGGFSLV QSLANVAGDV
QEPMRTELRR VFQEVQLGLS ITQALDNLVQ RMESTDLDLV VTAIKIHARV GGNLGQILEN
ISTTMRERAK LRREVRVITS MQRISSYVIG ALPFALALII FTINPTYMMR LFQPGLILCI
PIGAFVSSVA GFLIIRKIVD VRI