Gene Rcas_4211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4211 
Symbol 
ID5541722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5446724 
End bp5448292 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content60% 
IMG OID640896318 
ProductTPR repeat-containing protein 
Protein accessionYP_001434256 
Protein GI156744127 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.185796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGA CAGAACCGCT CAATTTCATG CCGCTCGATC TGGACACGTT CAACGGAAGC 
GAGCGATTCA TGGCGGGGAC GCGCCTGGGG GCGGCGTTTG GACAGGGCAT TCGCGCCTAC
CTGCGCGCTG ATTACGCAAA TGCGATCGAG CACTTCAAGG CTGCGTTGAT CGCCGCCTAT
ATCGAGGGAG AAGAACGTGC TCAGATCTAT GATCGCGAAC GTGCGATCAT CTATCTCTAC
ATCGGTAATG CACTGGCGTA CCAGGAGGAT TGGGAAGGAG CGCTGCGCGA GTATCTGGAA
GCGGTGCAGA CCGATCCGCA ACTGTCGGAG GCGCACTACA ACCTGGGCGT GGCATTTGCC
GCGCAGGGGC GTCTTGATCG TGCGATTGCC GCGTTCAAGG AAGCCATCGA GCATAATCCG
CGCCTGTACG AAGCGCACTT CTCGCTCGGA CGCTGCTATC AGCGTCTCGA CGACGCCGGG
CGAGCGTATA TTCACTACGA CCAGGCATGT CAGGCGCGTC CTCAGGCGGC CGAGCCGCGC
TACTACATGG GGTTGATGCA CCAGAGCCAC GGCGCGCACG AACTGGCGCA GCGCTGTTTC
GCCGAAGCGT TGCGCGTCGA GCCAACCTTC GTCTCGCCGG AGTTGCAGGA CGAAGTGCTG
GTCAATCGCT CGGAAGAAGA AGTCGCTCAG TGGTACTACC GCCTCAGCAA CGATCTGAAG
CAGCAGGGGT ACGAAGAGGA GGCGGAGCGG ATCTACCGGG CATTGCTCCA GTGGCGGCCA
GAAGAACACT ATGCGCGCTA TTTGCTCGGC AACCTGCTGG CGCGCGCGCG GCGGCTCGAT
GAGGCGCTCG AAGAGTATGC TCAGATCCCG CCACAGGATA AATATTATGT CGATGCGCGT
ATTCGTATCA GTGCTATTCT CAAGTTGCAG AACAAAACAC GCGAGGCGTA TGACACCCTC
TTCGAGTGCG CCAGGCTGCA CCCTGCCAAC GGTCAGTTGT TTCTGAACAT GGGCAAGCTC
CTCTACGATA TGAACAAACA TGCGGGCGCT ATCAAGGCAT TCGAGCGCGC GGTGCAGTTG
CTCCCCAACG ATCCGCAGGC GCACTATCTG TTGGGGTTTA TGTACAACCT GATGGGTCGC
GAGGGGTGGG CGCTGGCAGC CTGGCGCAAG GCGGTCGAAC TTGCGCCGGA CGCGCATTCG
CTGCGCTACG ACCTCGGCTA TATGTATGTG CGACGCAATC GCTACGATCT GGCGGCGAAA
GAGTTTGCGC GCGTGCTCCA ATTCTGGCCC GACGACGTCG AAACGAACTT TATGCTTGGG
TTGTGCTACA AAGAATTGAT GGAACCGGCG CGCGCCATTC CGCTGTTCGA GAAAGTGCTG
CGACGCAATC CGCGCCACGT GCAGGCGCTC TACTATCTGG GCGCATCGTA CTTGCAGATC
GGCAACACCT CGCTGGGGAA AGCCTATCTC AGACGCTACG ACTACCTGGC GAGCCAGGAG
CAGACAAGCC CGCCCACGAC GCGTCGCGCG ATGCGGCAAC GCAGCGTCGG GATGGTCGGT
TCATCGTGA
 
Protein sequence
MSSTEPLNFM PLDLDTFNGS ERFMAGTRLG AAFGQGIRAY LRADYANAIE HFKAALIAAY 
IEGEERAQIY DRERAIIYLY IGNALAYQED WEGALREYLE AVQTDPQLSE AHYNLGVAFA
AQGRLDRAIA AFKEAIEHNP RLYEAHFSLG RCYQRLDDAG RAYIHYDQAC QARPQAAEPR
YYMGLMHQSH GAHELAQRCF AEALRVEPTF VSPELQDEVL VNRSEEEVAQ WYYRLSNDLK
QQGYEEEAER IYRALLQWRP EEHYARYLLG NLLARARRLD EALEEYAQIP PQDKYYVDAR
IRISAILKLQ NKTREAYDTL FECARLHPAN GQLFLNMGKL LYDMNKHAGA IKAFERAVQL
LPNDPQAHYL LGFMYNLMGR EGWALAAWRK AVELAPDAHS LRYDLGYMYV RRNRYDLAAK
EFARVLQFWP DDVETNFMLG LCYKELMEPA RAIPLFEKVL RRNPRHVQAL YYLGASYLQI
GNTSLGKAYL RRYDYLASQE QTSPPTTRRA MRQRSVGMVG SS