Gene TM1040_1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1795 
Symbol 
ID4076824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1888716 
End bp1889606 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content59% 
IMG OID638007110 
ProductSec-independent protein translocase TatC 
Protein accessionYP_613790 
Protein GI99081636 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0805] Sec-independent protein secretion pathway component TatC 
TIGRFAM ID[TIGR00945] Twin arginine targeting (Tat) protein translocase TatC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0386851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.066575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAA CAGACGATCT TGACGATTCC ACAGCGCCGC TGATCGAGCA TCTGGCCGAA 
TTGCGCAGCC GCCTGATCCG GGCGGTCATG GCCTTTGCGG TGGGGATCGT GCTGGCCTTT
ATGGTAGCCG AACCGATCCT GCAGTTCCTC GTCGCTCCGA TCGAGCAGAC CCTACGAGAA
TTGGGCGATC CCTCCCCGAC GCTGCAGTAT ACCTCGCCGC AGGAGTATCT CTTTACCCTT
TTCCGGATCT CGATGGTGTT TGGATTTGCG CTGTCGTTCC CGGTGATCGG CTTTCAGCTC
TGGCGGTTTG TGGCGCCGGG CCTCTACAAG AGCGAGAAAG GCGCGTTTCT GCCGTTCCTG
ATTGCCTCGC CCTTCATGTT CCTGCTTGGC GCGTCCTTTG CGCAATTTGT GGTGACGCCA
CTGGCAATGC AGTTCTTCCT CGGCTTTGCT GACGTGAGCT CGATCTTTGC GGGCCTGTTG
TCGCAAGCCA CCGGAGGCGA CGTGCCTGCG GATGTTGCCG TGGTGCCGGA GACATCCGAA
GGGGTGAAGA TCACCTTCTT TGGCAAAGTG AACGAGAGCC TTGATATTAC GCTCAAATTC
ATCATGGCCT TTGGTCTGTG CTTCCAGCTG CCGGTTCTTC TCACCCTGAT GGGCAAGGCC
GGATTGGTGA GCGCCGAAGG GCTGGGTGGC ATGCGCAAAT ATGCGGTTGT GGCCATTCTG
GTGCTGGCCG CGTTGGTGAC GCCGCCGGAT GTGATCACCC AGATCATTCT CTTTACGGTG
GTCTACGGGC TTTATGAGGT ATCGATCTTC CTCGTCGGGC GCGTCGAGAA AAAGCGCGAG
GCGCAGCTGC GCGCCGAAGG CTACTATGAC GACGAGCTGG ACGGCGAATA A
 
Protein sequence
MSQTDDLDDS TAPLIEHLAE LRSRLIRAVM AFAVGIVLAF MVAEPILQFL VAPIEQTLRE 
LGDPSPTLQY TSPQEYLFTL FRISMVFGFA LSFPVIGFQL WRFVAPGLYK SEKGAFLPFL
IASPFMFLLG ASFAQFVVTP LAMQFFLGFA DVSSIFAGLL SQATGGDVPA DVAVVPETSE
GVKITFFGKV NESLDITLKF IMAFGLCFQL PVLLTLMGKA GLVSAEGLGG MRKYAVVAIL
VLAALVTPPD VITQIILFTV VYGLYEVSIF LVGRVEKKRE AQLRAEGYYD DELDGE