Gene RoseRS_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0031 
Symbol 
ID5206964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp31829 
End bp33397 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content60% 
IMG OID640593665 
ProductTPR repeat-containing protein 
Protein accessionYP_001274424 
Protein GI148654219 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.918489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGA CAGAACCGCT CAATTTCATG CCGCTCGATC TGGACACGTT CAGCGGGAGC 
GAGCGATTCA TGGCTGGAAC GCGCCTGGGC GCGGCGTTCG GTCAGGGTAT CCGCGCCTAC
CTGCGCGCTG ACTACGCCAA TGCGATCGAG CATTTCAAAG CCGCTTTGAT CGCTGCATAC
ATCGAAGGTG AAGAACGCGC TCAGATTTAT GATCGCGAAC GGGCGATCAT CTATCTGTAC
ATCGGCAATG CGCTGGCGTA TCAGGAGGAT TGGGAAGGGG CGCTGCGCGA GTATCTCGAA
GCCGTGCAGA CCGATCCGCA ACTGGCTGAG GCGCACTACA ACCTGGGGGT GGCATTTGCG
GCGCAGGGGC GGCTCGACCG CGCGATTGCC GCATTCAAGG AAGCCATCGA ACACAATCCG
CGCCTGTACG AGGCGCACTT CTCACTGGGA CGCTGCTATC AGCGCCTCGA TGATGCCGGT
CGCGCGTACA TTCACTACGA TCAGGCGTGT CAGGCGCGCC CGCAGGCCGC AGAGCCGCGC
TACTACATGG GATTGATGCA CCAGAGCCAC GGCGCGCACG AACTGGCGCA GCGTTGCTTT
GCCGAGGCGC TGCGCGTCGA GCCGACCTTC GTCTCGCCAG AATTGCAGGA CGAGGTGCTG
GTCAACCGCT CGGAAGAGGA AGTCGCCCAA TGGTACTACC GCCTCAGCAA CGATCTGAAA
CAGCAGGGGT ACGAGGAAGA GGCGGAACGG ATCTACCGCG CGCTGCTCCA GTGGCGTCCC
GAAGAACATT ATGCCCGTTA TCTGCTCGGC AATCTGCTGG CGCGCGCGCG GCGTCTCGAT
GAAGCGCTTG AAGCGTATGC CCAGATTCCG CCACAGGACA GATATTACGT CGATGCGCGC
ATTCGGATCA GCGCGATCCT CAAACTTCAG AACAAGATGC GCGAGGCGTA TGATACCCTG
TTCGAGTGCG CCAAACTGCA CCCGACCAAT GGTCAGTTGT TCCTGAATAT GGGTAAGTTG
CTCTACGATA TGAACAAACA CGCTGGCGCT GTCAAAGCGT TTGAGCGCGC CGTGCAACTG
CTCCCCAACG ATCCGCAGGC GCACTACCTG CTGGGGTTTA TGTACAACCT CATGGGACGC
GAGGGATGGG CGCTGGCAGC CTGGCGCAAG GCAGTGGAAC TCGCTCCGGA CGCGCATTCT
CTGCGCTACG ACCTTGGCTA CATGTACGTG CGACGCAACC GCTATGACCT GGCAGCAAAA
GAGTTTGCCC GCGTGCTCCA GTTCTGGCCC GATGATGTCG AGACGAACTT TATGCTCGGA
TTGTGCTACA AAGAACTGAT GGAACCGGCG CGAGCCATTC CGCTGTTTGA AAAAGTGCTG
CGTCGCAATC CGCGCCACGT GCAGGCGCTC TATTATCTCG GCGCTTCGTA CCTTCAGATT
GGCAATACAT CGCTTGGCAA GGCGTATCTC AGGCGCTACG ACTACCTGGC GAGCCAGGAA
CAGTCGAGTG CGCCTGTGAC GCGCCGGACC ATGCGCCAGC GCACTGTCGG CATGCTAGAG
TCGTCGTAA
 
Protein sequence
MSSTEPLNFM PLDLDTFSGS ERFMAGTRLG AAFGQGIRAY LRADYANAIE HFKAALIAAY 
IEGEERAQIY DRERAIIYLY IGNALAYQED WEGALREYLE AVQTDPQLAE AHYNLGVAFA
AQGRLDRAIA AFKEAIEHNP RLYEAHFSLG RCYQRLDDAG RAYIHYDQAC QARPQAAEPR
YYMGLMHQSH GAHELAQRCF AEALRVEPTF VSPELQDEVL VNRSEEEVAQ WYYRLSNDLK
QQGYEEEAER IYRALLQWRP EEHYARYLLG NLLARARRLD EALEAYAQIP PQDRYYVDAR
IRISAILKLQ NKMREAYDTL FECAKLHPTN GQLFLNMGKL LYDMNKHAGA VKAFERAVQL
LPNDPQAHYL LGFMYNLMGR EGWALAAWRK AVELAPDAHS LRYDLGYMYV RRNRYDLAAK
EFARVLQFWP DDVETNFMLG LCYKELMEPA RAIPLFEKVL RRNPRHVQAL YYLGASYLQI
GNTSLGKAYL RRYDYLASQE QSSAPVTRRT MRQRTVGMLE SS