Gene Sare_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2371 
Symbol 
ID5705112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2724741 
End bp2725694 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content67% 
IMG OID641271849 
ProductSec-independent protein translocase, TatC subunit 
Protein accessionYP_001537220 
Protein GI159037967 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0805] Sec-independent protein secretion pathway component TatC 
TIGRFAM ID[TIGR00945] Twin arginine targeting (Tat) protein translocase TatC 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0057855 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCTTTG CCCTGCGTAA ACGCGGCCCG AGCAGCTTCC AGCGGGCCTC GGAAGGCTCG 
ATGACCCTGG TCGAGCACGT CCGCGAGTTG CGCGACCGGC TGTTCCGCGC GTCGCTGGCT
GTCGTCGCCG GCCTGATTGT CGGTTTTGTC CTTGCGCAAC CGGCATTCGA TCTGCTGAAA
GAGCCCTACT GCAACCTCCC GGACTCCACG AACGCGGACG GGGTGTGCCA GGGGTTCCTG
CAGCTGTCCC CAGCGGACGG GTTCCTCCTC AAGCTCAAGC TGGCCCTGTG GATCGGCCTG
ATCGTCGCGG CACCAGTCTG GCTCTATCAG CTCTGGGCGT TCATCGCGCC GGGTCTGCAC
CGGCACGAGC GTAAATGGGC GTACGTCTTC GTCGCCATCG CCGCCCCGCT CTTCGCCGGT
GGCGCCGTCC TCGCCTACCT GGTGGTGGAC AAGGGCCTGG CGTTTCTCAT GGAATCCGGT
GTCACCGGGC TGTCCACGCA ACTCGAGGTG ACCCGCTACA TCTCGTTCGT CACGACCATG
ATCCTGCTCT TTGGGGTGGC GTTCGAGTTT CCCCTGATCC TGCTGATGTT GAACTTCACC
GGGGTGGCCA CCGCGCGGCG GCTGCTCAGC TGGTGGCGCG TGGTGATCTT CGTCTGCTTC
GCCTTCGCCG CCATCGCGAC CCCGGATCCG GGGCCCTTCG GGATGACGTT GCTCGCCCTG
TCGCTGTCGC TGCTGTACTT CGTCGCCGTG GGCGTCGCGT TCCTCAACGA CAGACGTCGG
GGGCGCGGTA AGGAGATCTA CGCAGGCCTC GCCGACGACG AGGTGTCGCC GCTGAAGGAC
GACAACGAGC CGATCGAGGC CAGTGCCCCG GTCGGCGCGC CCGACTCGAT CGCGGAGCCC
GAGCCGGTTG CCAAGCCCGC GCCGATCGAG CGTCGCTACG ACGACATGAC CTGA
 
Protein sequence
MAFALRKRGP SSFQRASEGS MTLVEHVREL RDRLFRASLA VVAGLIVGFV LAQPAFDLLK 
EPYCNLPDST NADGVCQGFL QLSPADGFLL KLKLALWIGL IVAAPVWLYQ LWAFIAPGLH
RHERKWAYVF VAIAAPLFAG GAVLAYLVVD KGLAFLMESG VTGLSTQLEV TRYISFVTTM
ILLFGVAFEF PLILLMLNFT GVATARRLLS WWRVVIFVCF AFAAIATPDP GPFGMTLLAL
SLSLLYFVAV GVAFLNDRRR GRGKEIYAGL ADDEVSPLKD DNEPIEASAP VGAPDSIAEP
EPVAKPAPIE RRYDDMT