Gene TM1040_2346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2346 
Symbol 
ID4076465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2466653 
End bp2468146 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content59% 
IMG OID638007668 
Producttype II and III secretion system protein 
Protein accessionYP_614340 
Protein GI99082186 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.397877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACG AAGTCGCGCA CATTAGCCGC AAAATCGGGG CGCAAAGACC CCACATCAGA 
GGCGTGATCG GAAAGGCAGG TCACATGTCA ATTAGACGGT ATTGTGCGGC GGCCCTCACG
GGGCTGGCCC TCCTATTTGC TCATTCTCCC TCGCCCGCAG TGGCGCAGGG GATCAGCGTA
TTGAAAAAGG GCACCAACGC GGTTCTCGAT GTGCCGATGA ACCGGGCCGT GGTGGTCGAA
GCCGATGTGC CCTTTGCGGA ACTCAGCATC GCCAATCCCA GCATTGCGGA TATTTCCTCA
CTGTCGGATC GCACGATCTA TGTGCTGGGC AAATCGCCCG GCCTGACGAC GCTGACGCTG
CTGGACGGCT CTGGCGGGCT GATCACCAAC GTGGATGTGC GCGTCGCTGC GGATGTGAGC
GAGTTCAAAC AGCGCCTGCA GCAGATCCTG CCCAATGAAA AGATCGAAGT GCGCACCGCC
AATGATGGCA TTGTTCTCTC TGGCACCGTT TCCAGTGCCC AACGCCTGCA GCGCGCGCTG
GATCTGGCAG AACGCTATGC TCCGGATCGC GTCAGCAACC TGATGACCGT GGGCGGTATT
CAGCAAGTGA TGCTGAAGGT CCGCTTTGCC GAAATGGAGC GCTCCGTCAG TAAGTCGCTG
GGTGGCTCCA TGCTGATCAG ATCTTCGGAT GGCGCGATTG CCACCGGCTC CTTCCAGAAC
CGTGGCGGCA ACGGAAATAT CTTTGGCGAC AGTGTCACAA GCCCCGTTCA GCTCCAGAGC
GAGACCCTTG GCGCAGCCCT GTTCGGTTTC GACATTGGCG CGGTGCAGTT CAACGTGCTC
CTGGAAGCAC TCGAGCAAAA AGGTCTGGTA CGCACCCTGG CGGAGCCGAA CCTCTCGGCC
CTTTCGGGTC AAGAAGCCAA TTTCCTCGCC GGTGGTGAAT ATCCGGTCCC CGTTGCGCAA
GAAGATGGCG TGATCACGGT TGAGTTCAAA CCCTTCGGGA TTGAGCTGAA CTTCATCCCG
CGCGTGGTTG ATGGCGATCT GATCAATCTG GAGCTCAAGG CTGCAGTCTC GGCGATCGAC
ACCACCAATT CCGCGACCTT TGACGGGTTC TCCATCAACG CCTTCTCGCG CCGCGAAACC
GCAACGACGG TAGAGCTGCG CGATGGTGAG AGCTTTGCGA TTGCCGGTCT CATTGAGGAC
GAGTTCCGCG ACGGGGCCGC ACAGGTGCCT TGGCTTGGCG ATGTGCCGGT TCTGGGCGCC
CTCTTCCGCA GTGCAGATTA CGCCCGCAAC CAGAGCGAAT TGGTTATCAT CGTCTCGGCT
CATCTGGTGA CCCCGACCCG CGGCGAAGCG CTGGCGCTGC CAACAGATCG CATTCGCCCC
CCGAGCGAGA AGAATCTGTT CTTGTTCGGA CAGACCGAAC GCGCACAGGC CGGGGCTGCA
GGCGAAGTTG CAAATCAGGA CTTCAACGGC TCTTATGGCT ATCTACTTGA TTGA
 
Protein sequence
MKHEVAHISR KIGAQRPHIR GVIGKAGHMS IRRYCAAALT GLALLFAHSP SPAVAQGISV 
LKKGTNAVLD VPMNRAVVVE ADVPFAELSI ANPSIADISS LSDRTIYVLG KSPGLTTLTL
LDGSGGLITN VDVRVAADVS EFKQRLQQIL PNEKIEVRTA NDGIVLSGTV SSAQRLQRAL
DLAERYAPDR VSNLMTVGGI QQVMLKVRFA EMERSVSKSL GGSMLIRSSD GAIATGSFQN
RGGNGNIFGD SVTSPVQLQS ETLGAALFGF DIGAVQFNVL LEALEQKGLV RTLAEPNLSA
LSGQEANFLA GGEYPVPVAQ EDGVITVEFK PFGIELNFIP RVVDGDLINL ELKAAVSAID
TTNSATFDGF SINAFSRRET ATTVELRDGE SFAIAGLIED EFRDGAAQVP WLGDVPVLGA
LFRSADYARN QSELVIIVSA HLVTPTRGEA LALPTDRIRP PSEKNLFLFG QTERAQAGAA
GEVANQDFNG SYGYLLD