Gene Rcas_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3987 
Symbol 
ID5541497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5198892 
End bp5202170 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table11 
GC content59% 
IMG OID640896099 
ProductTPR repeat-containing protein 
Protein accessionYP_001434038 
Protein GI156743909 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.017736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGCA ACCGTGCATT GTTCGACCGC GCGATGGAGC AAAGCCGCGA AGCAGCGCGG 
CTTATGAACT GGGACGAGGC GCTGAAACAG GCAGCCCGCG CCTTGCAAGA ATTCCCTCAG
GACCTGGATG CGCGCTTGGC TGTCGCTGTT GCGTTCTTCA ACACCGCAAA GTATGCGCAG
GCGCTTCAGA TTTTCGATGA ATTGCGTCGC ATCGATGCCG GCAATCCATT CTATCTGGAA
TATCTGGCGC GTACCCACGA GCGTCTCGGC GACGTGCAGG CGGCAACCAA CGCCTATGTG
CAACTTGCCG ATCTTCAGAT GAACCGCAAA CTGGCTGCAC GCGCCATCGA TGCCTTACGC
GAGGCGCTAC GCCTGCAATC CGACGCCGAC GATCAGCGGC TGCGTCTGGC GCAGTTGCTC
GCCGATCAGG GCGCGAGTGC AGAGGCGGCA ACGCACTATC TCGAACTGGC ACGGCGCGCT
CAAGCGCGTG GGAGCCTCGA ACAGGCGGCG GAACTGGCGG AAATGGCGTT GCGCTGCGTA
CCCGACAATC GTGAAGCCAA AGAGTTAATC GCGGCGCTCC ACGATGCCCT GGCACAGACG
GTCCAATCAG CAATCGAAGC GACCGCAACC GCCGCCGATG CAACCGTATT GCCCATTATC
GGCACCGGCG GTTTGCGCGG TGCGCACCTC GCCATCGAAC GAATCATCGC GCTGGCGCAC
GAACGTCAGG AGGCCGGCGA TATTGATGGC GCCATCGAGC AGTATGAACG CGCGCTCAAA
CTTGGAACCG ACCGCAGCGA TGTGTTTTAT AGCCTGGGCT TGCTGTATCA GGAACGCGGC
GACTATCAAC GCGCGGTGGA ACTGTTGCAG GGCGCCGCTG GCGACCCGGA ATACGCCCTT
TCAGCGCATT ATATGCTTGG TCAGGCGTAT CAGGCGCTGG GCCAACTGCC CGAAGCAGCG
CACGAATACG AACAAACTAT TCGTCTGTTG CCGCTGGAAT CGATTGGACG CGCCGAAGCC
GACGATATGA TCCAGATGTA CGAGAGCGCC GCGCAGATCT ATATTCAACT CAACGACATT
GCGCGCGCTG CAACGCTCTA TTCGACGCTG GCAAACTTTC TTCAGGGCAA ACGCTGGGGA
CGCGAGCGCG CCGATGAGTT CCGCCAGAAA GCCAAAGACC TGACCGAGCG CAATATGTTT
GCCAAACTCC GCACGCTCGG CACGGGCGCA CTGACATCAC CGCCATCCGC CCCTGAACCC
GAAACGCCTC CCGAAAGTCC GATGCCCGAA ACCTGGGGGA AAATTCGCCC TATTACCGAT
TTTCTGCGAG CGCCTGAGCC ACAGACCCCC GACCATCATC GCTTTGAACC ATCCCCATCA
GTCGCCGCCG AACCGGTCGA CCCGCTGGCA ATCCTCGAAA CGCTGCCGCA TCCCGAACAC
GTTCCCGTTG CACCGGTAAC ACCGCTCGAC ACGACCGGAC TGGATGAAAT CGGTGAACGG
TATGTGCTGG CGAGCGAAAA GTATGTCGAA CAGGGATTGA TGCTGGCAGC CAACGACGCA
TGTATGGAGG TTATCAGGCT GAACCCGGAG TATCTGCCGA TTCACCTGCG ATTGGGAGAG
ATTTACGAGC GCGATGGACG CAAAGACGAA GCATTAATCA AGTATCAACT GTTGATCGAC
ACGTATGTGG CGCGGGGAGA ACCCCGGCGC GCCATCGACG TTTACTACCG GCTGATCGAA
CTGTCGCCCG ATACAATTAT GCCGCGTTCG CGGCTGGCTG AGTTGCTGCG CGCTGACGGA
CGAAACGAAG AGGCAGCGCA GCAACTTGCC ATTGTTGCCG GCGCCTATTT TCGGATGGGA
CAAACGACCA AAGCGCTCGA GGAGTACCGA CGCGCCCTGC AATGGTCGCC GTCCAATGCA
GAACTCCACG CACAGTATGG GCAGGCGCTG CTGAAACTCG ACCGCGCCGA GGCAGCGCTG
GTCGCTTTCC GGCGCGCTCT CGAACTCGAT CAGCAGAATC CGGTTCATAT TGCACGCATC
AACATGGCGC TGGCCATTAT GGGTGAGCAA CCCGTCGCCG TCTGGCAATC GCTGGCGACG
CTGCTCGACC AGCTCAAACA GCATCCCCAA CGCCTGAATG AAGTACAATC CGAGTACCGC
GCCACGTTAC TGGTTGCCGA TCTTCCCATA TTGCACTACA TTCTGGGCAT TGTTCAGCAG
CACGCCGGGC AACATCCATC GGCGCTGCTG GAGTTCGAGC AGGCCATCGA ACTGCTGAAC
GCCGAGAACG ATCCGACTCT GACGCCACAC CTGGTTTATC AGGCAATGGC GGATAGTCAC
ATTGCACTGG GTCAGGCGAG TGAGGCGTTG CGCCAGCTTC AGCATGCCCT GGATCTTGCT
CCTGCGCCGC CGCCGCCGGA AAACGCCCGT TATCCGTTTT CGCTCCCCCT TTCTCAGGGC
GAAATCGTGC GCCGTATGGC GGAAGCATAT GCCGCTGTCG GCGATCTCGC CAGCGCCGAA
CGCGCGCTTC AGGAAGCCAA ACAGTTTCTG CCCTACGACC GCGCGATCTA CACCAAACTG
TCCGATATCT ATTTTCGGCA GGGTCGCCTG AATGAAGCGC TGACGCAACT CGACGAACTG
GCGACCCACT ATGAGCAGCG CCAGATGCTG GATCGCGCGA TCGAGGCACT CGAAAATGCG
CTGCGCCTGG CGCCCAACAA TATTCCGATC AGCCATCGCC TGGCGAAAAT GTATATCCGG
CGCGGCTACC TGGATAAAGG CGTCGAAGCG CTGACGCGTG TCGCAGAACT TCAACGGAAG
GAAGGGCAGA TCAAAGACGC CATTGCCAAC CTTCAGCAGG CGGCCGAGGT GCATTGGACG
CTTGGCAGGC AGGAGGAGGC GCGCGCACTC TATGACAAAA TCGTGCATAT CGCACCCAAT
GATATTGAAG CACGCCAGTG GCTGTCGTTT ATGTACACCC TGGCCGGCAT GACGCGTGAA
GCCGTTGCGC AAAAGAAACA GATTATTCGC ATCCTGCTCC AGCGGCGCGA TCTGGATAAC
GCTATTGCGG AGATGCATCA AATCTACGGA CTCGACCAGA ACGATACCGA TAATCTGTTT
CAACTCGGTG ATGCGCTTAT GCGGCGGCAG GAATATGAAC AGGCAATCCG CATCTATACC
AGGCTGGCAA AAGTGCCAGG CGTCGAAATT GAACGCGTCG AAGCGTTACA GGCAGCAGCC
AGACGGATGC TCGAACAACA ACAGGCAGGA AACCGGTGA
 
Protein sequence
MPGNRALFDR AMEQSREAAR LMNWDEALKQ AARALQEFPQ DLDARLAVAV AFFNTAKYAQ 
ALQIFDELRR IDAGNPFYLE YLARTHERLG DVQAATNAYV QLADLQMNRK LAARAIDALR
EALRLQSDAD DQRLRLAQLL ADQGASAEAA THYLELARRA QARGSLEQAA ELAEMALRCV
PDNREAKELI AALHDALAQT VQSAIEATAT AADATVLPII GTGGLRGAHL AIERIIALAH
ERQEAGDIDG AIEQYERALK LGTDRSDVFY SLGLLYQERG DYQRAVELLQ GAAGDPEYAL
SAHYMLGQAY QALGQLPEAA HEYEQTIRLL PLESIGRAEA DDMIQMYESA AQIYIQLNDI
ARAATLYSTL ANFLQGKRWG RERADEFRQK AKDLTERNMF AKLRTLGTGA LTSPPSAPEP
ETPPESPMPE TWGKIRPITD FLRAPEPQTP DHHRFEPSPS VAAEPVDPLA ILETLPHPEH
VPVAPVTPLD TTGLDEIGER YVLASEKYVE QGLMLAANDA CMEVIRLNPE YLPIHLRLGE
IYERDGRKDE ALIKYQLLID TYVARGEPRR AIDVYYRLIE LSPDTIMPRS RLAELLRADG
RNEEAAQQLA IVAGAYFRMG QTTKALEEYR RALQWSPSNA ELHAQYGQAL LKLDRAEAAL
VAFRRALELD QQNPVHIARI NMALAIMGEQ PVAVWQSLAT LLDQLKQHPQ RLNEVQSEYR
ATLLVADLPI LHYILGIVQQ HAGQHPSALL EFEQAIELLN AENDPTLTPH LVYQAMADSH
IALGQASEAL RQLQHALDLA PAPPPPENAR YPFSLPLSQG EIVRRMAEAY AAVGDLASAE
RALQEAKQFL PYDRAIYTKL SDIYFRQGRL NEALTQLDEL ATHYEQRQML DRAIEALENA
LRLAPNNIPI SHRLAKMYIR RGYLDKGVEA LTRVAELQRK EGQIKDAIAN LQQAAEVHWT
LGRQEEARAL YDKIVHIAPN DIEARQWLSF MYTLAGMTRE AVAQKKQIIR ILLQRRDLDN
AIAEMHQIYG LDQNDTDNLF QLGDALMRRQ EYEQAIRIYT RLAKVPGVEI ERVEALQAAA
RRMLEQQQAG NR