Gene Phep_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3820 
Symbol 
ID8254954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4583951 
End bp4584844 
Gene Length894 bp 
Protein Length297 aa 
Translation table11 
GC content40% 
IMG OID644937484 
ProductSec-independent protein translocase, TatC subunit 
Protein accessionYP_003094073 
Protein GI255533701 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0805] Sec-independent protein secretion pathway component TatC 
TIGRFAM ID[TIGR00945] Twin arginine targeting (Tat) protein translocase TatC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00172839 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA ATAAAAAACG TGACCTGATT GGGGCCATCA AAGAAAAAGG AAAGACTTTA 
GAGGCAGAAA TGTCTTTTTT TGACCATATA GATGTACTTA GAAAACATTT ACTACGCTCA
TTACTGGTCG TAGTATTATT TACCATAGGC GCATTCTGGT TTTCAGACTT TATATTTAAT
GATCTGATCA TGGGGCCTAA GAACCCTGAT TTCTGGACTT ACAGAATGAT GTGCAAGATG
GCTGCTGCAT GGCCGAACCT GATCGGTTCA GACTTTTGCA TCACGCATAT TGATGCCAAG
ATCATCAATA CAGAAATGGC CGGACAGTTT ACCCTGCAGA TCAATGCCTG CGTAATGGTC
GGCATTATAC TCGGCATCCC ATATATTTTA TTTGAACTAT GGCTGTTCAT TAAACCTGCC
TTACACGACA ATGAGCGCAA ATCGGCCAGT CATTTTGTGA TGTTTGCCTC TACACTTTTC
TTCATAGGGA TCCTGTTCGG TTATTATATA GTATGTCCGC TGTCCGTAAA CTTCCTCACA
AATTTTACGG TAAGCCCTGA TATACAGAAT ACCTTTACCA TCACTTCTTA TCTTTCTTCT
GTAGCGACAT TAACCATTGG CTCCGGGATC ATCTTTCAGC TGCCCGTAGT GATTTATATT
CTGTCAAAGT TCGGTATCAT GACCCCTAAG TTCATGCGTT CAACCAGAAG ATATGCCGCT
GTAATTATCC TCATTGTTGC TGCAGTTGTA ACACCTACAG CAGACGTGAT GACAATGCTT
GTAGTAGCCT TCCCACTATT TGTACTATAC GAACTGAGTA TCTTTATATC AGCTAATATT
GAGCGCAAAA GAAATAAGGA GCTTTATGGG GTAGCCAAAG TAAAGAAATC GTAA
 
Protein sequence
MSDNKKRDLI GAIKEKGKTL EAEMSFFDHI DVLRKHLLRS LLVVVLFTIG AFWFSDFIFN 
DLIMGPKNPD FWTYRMMCKM AAAWPNLIGS DFCITHIDAK IINTEMAGQF TLQINACVMV
GIILGIPYIL FELWLFIKPA LHDNERKSAS HFVMFASTLF FIGILFGYYI VCPLSVNFLT
NFTVSPDIQN TFTITSYLSS VATLTIGSGI IFQLPVVIYI LSKFGIMTPK FMRSTRRYAA
VIILIVAAVV TPTADVMTML VVAFPLFVLY ELSIFISANI ERKRNKELYG VAKVKKS