Gene Rcas_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4006 
Symbol 
ID5541516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5214797 
End bp5216149 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content57% 
IMG OID640896118 
Productpreprotein translocase, SecY subunit 
Protein accessionYP_001434057 
Protein GI156743928 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000605815 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000026914 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTGAAT CGGTTGTCAA CGCGCTCCGA TTGCCCGATC TGCGGCAGCG CATCCTGTTC 
ACGCTGGCAA TGCTGCTGCT CTTTCGGCTG ATTGCTCACA TTCCTGTGCC GAACATCGAT
CCGATGGCGC TGGAAAGCCT GCGGCTCGCT TTGCAGGGCA ATCAGCTCGC GCAGTTACTC
AACATTTTCG CCGGCGGCGC ATTGCAGAAC CTGTCGGTTG CGGCGATGGG CGTCTATCCG
TACATCACAG CGCAGATCAT TCTGCAACTC CTGGTACCGC TCATTCCGGC GCTGGAAGAA
TTGCGGAAAG AGGGCGAACA GGGGCGCATG CGGTTGAACC GGATCACGTT CTACCTGACG
ATCCCGATGG CGCTGTTGCA AGCCTATGGG CAAACGTTGA CGCTTGAACG CAGCCTCAGA
ACCGGTCAGG CGCTGTTTCA AACGCCGTTC GATATTGTCA ATAATTTCTT CCCGACGTTC
ACAATCCTGA TGAGTATGCT GGCAGGTACG ATGCTGCTGG TGTGGCTTGG CGAGCAGATC
CAGGAACGCG GGATCGGCAA CGGCGTCTCG ATGATTATTT TCGCAGGCAT TGTAGCCGGT
TTGCCGGGGT TGATCATTCA GGCATTCACC ACCGTCGAAC TGGGCGGGCT GGAACAGGCC
ATCGGCTTGA TCGCCTTCCT GATCATCGCA CTTGGAACGA TCATCGGCAT CGTGCTCATG
CACGAGGGGC AGCGGCGCAT CCCGGTGCAA TACGCCAAGC GGGTGCGCGG CAATCGCGTG
TATGGCGGGC AGAGCAGCCA TATTCCGCTC AAGGTCAATA TGGCAGGCAT GATCCCGCTG
ATCTTCGCGC AGAGCATCAT TATTTTTCCG GGCACAATCG CCTCCTACGG CTGCCCTGAA
CAAGTGGCGC CTCCCAACGC AAGCGTGCTG AAGCAGATCG CCTGCTTCAC CTATCAGACG
TTCAGCCCGC AGTATGGCGG CGGCACCCTG GTGTACTCGA TCGCGCTCTT CGTGCTGGTT
TATTTCTTCA CCTACTTCTA TACGAAGGTG ATTTTCGACC AGCAGAATAT CCCGGAGACG
CTGCAACGCA ACGGCGGATT CATTCCGGGT ATTCGGCCCG GGAAACGCAC TGAAGAGTAT
CTTGACCAGG TGGTGAGCCG AATCACACGA ATTGGCGCAC TCTTCCTGGG AACGGTCGCT
ATCTTGCCGT TCATCACCCA ACAACTCACC GGTGTACCGA TTGGTCTGGG AGCGACTGCA
TTGCTCATTG TCGTCGGCGT CGCGGTCGAT ACAATGCGGC AACTCGAAGC GCAACTGGTG
ATGCGGAACT ACGAAGGGTT TATCAACCGT TGA
 
Protein sequence
MLESVVNALR LPDLRQRILF TLAMLLLFRL IAHIPVPNID PMALESLRLA LQGNQLAQLL 
NIFAGGALQN LSVAAMGVYP YITAQIILQL LVPLIPALEE LRKEGEQGRM RLNRITFYLT
IPMALLQAYG QTLTLERSLR TGQALFQTPF DIVNNFFPTF TILMSMLAGT MLLVWLGEQI
QERGIGNGVS MIIFAGIVAG LPGLIIQAFT TVELGGLEQA IGLIAFLIIA LGTIIGIVLM
HEGQRRIPVQ YAKRVRGNRV YGGQSSHIPL KVNMAGMIPL IFAQSIIIFP GTIASYGCPE
QVAPPNASVL KQIACFTYQT FSPQYGGGTL VYSIALFVLV YFFTYFYTKV IFDQQNIPET
LQRNGGFIPG IRPGKRTEEY LDQVVSRITR IGALFLGTVA ILPFITQQLT GVPIGLGATA
LLIVVGVAVD TMRQLEAQLV MRNYEGFINR