Gene TM1040_3428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3428 
Symbol 
ID4075602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp450816 
End bp452498 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content61% 
IMG OID638004937 
Producttetratricopeptide TPR_2 
Protein accessionYP_611662 
Protein GI99078404 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0384278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTG CGTTTTCTCC CCGCGATCTG ACCCCGCTCT ATGCTTCGGC GGTTGCGCTG 
CTCAATGCGG GCAATCTTGC AGCAGCGAGG GAACAGCTTT TGCGGTCTTT GCATGAAGAA
GGGGAGACGG CGCTCAGCTA CCACTATCTC GCGCAGATCC TCGAGCGCGC CGGTGGGCCC
GGCACCGAGG TGCTGGCGGC GCAGGCCAAT GCGTTGAAAT GTGCGCCCAA CCACCCGGTG
TTTCTGGCAG CACTCGGTGT GCGCTTGGTT GATGCCGGGC GCGATCAAGA TGCGCTTGTG
CTCCTCAGGA AGGCGCTGGA GATCGACCCC CAAAACCCTC TGGCGCTCCC CTGGGTCATG
CGGCTCAATC GCCGCTACTT GCAGTGGACC TCATATGAGG AAGAGGCCGC AGCATGTGAG
ATGCTCCTCA AGAGCGCGCA CAAGGTGGAT CCGCTGCTCT CGCTGACATT TGTGGATGAT
CCAGCCCAGC AGTTGAAGAT CGCCGCACGC AGCGCGCCCG ATGTCGCCGT TCAGCCGCTG
GAGCCGCATG CCCCGCATGA AAAGATCCGT GTCGGGTATT TCTCGGCAGA TTTCAGCGAA
CATGCCACGA TGCATCTGAT CGAAGGGCTG ATTGAGGCGC ACGACAAAGG CGTCTTTGAA
TTCCATGTGT ATGATTTCAA ACCGGATCCC ACGTCTCGCC AGCATCAGAT CATCCGTGAT
TTTGCAGATT GCTATCACGA CGTTAGTGCG CTTTCGGCCG CAGAAACGGC CGCGTTGGCC
CGCCAGGAGC AGCTTGATAT TGCCGTTGAT CTCAAGGGGA TCACCACCGA CTCGCGACCA
ATGATCTTTG CGCTCCGGGT CGCGCCTGTG CAGGTTTCAT TTCTGGGATT TCCAGGCACA
ACCGCGATTG CGGAAATGGA CTATATGATC GCGGATCGCA TCACCATTCC TGACGGGGAT
GAACGGTTTT ACAGCGAAAA GATTCTCCGC CTGCCGGGAT GCTACCAGCC CAACACGAAT
TCGCGCATCC TGCCTGAGGG GGCAGGTGGG CGCGCCGCCT TTGGCCTGCC GGACGACCGG
TTTGTATTCG CGAGCTTCAA TCATCCGCAC AAGGTTGGCC CGAGCGAATT TGCAACTTGG
ATGGAGATTT TGCGCGAGGT GCCGCAGGCG GTGCTCCTGT TCTATTCCGG AAAGGCGGAC
CTGGGCGCCG CGCTGGCCGA ACGAGCCCAG GCCCACGGGG TTGAGCCGTC GCGGGTCTTG
GCCTGCGGCC CGCTTCCCCA GACGGCGCAT CTGGAGCGGA TCGCGCAGGT CGATCTTTGT
CTGGATTGTT TTGCCTACAA TGCCCATACC ACCGCTTCGG ATGCGGTCTG GGCGGGGGTG
CCCTTGCTGA CGCTTGCAGG TCGTCAGTTT GCGGCACGCG TTGCGACCAG CATCCTGTCT
ACGGCGGGTG TGCCGGAACT CTCGACGGTC TCGACCAAGG ACTATGTGGC AAAGGCGGTG
CATCTTGCCA CCCACCCAGA GGATCTTCTG GCCCTAAGGC AGAAAATTTC AGCGGCGCGG
CACGGCTCGC CTCTCTTTGA TACCAAGCGC TGGACCCGTG ACTACGAGGC TCTCTTGCAG
ATGTGCTATC AACGCCACCG GGCCGGTGAG GCACCCGACC ACATGTCCCT CAGCGACGGC
TGA
 
Protein sequence
MTIAFSPRDL TPLYASAVAL LNAGNLAAAR EQLLRSLHEE GETALSYHYL AQILERAGGP 
GTEVLAAQAN ALKCAPNHPV FLAALGVRLV DAGRDQDALV LLRKALEIDP QNPLALPWVM
RLNRRYLQWT SYEEEAAACE MLLKSAHKVD PLLSLTFVDD PAQQLKIAAR SAPDVAVQPL
EPHAPHEKIR VGYFSADFSE HATMHLIEGL IEAHDKGVFE FHVYDFKPDP TSRQHQIIRD
FADCYHDVSA LSAAETAALA RQEQLDIAVD LKGITTDSRP MIFALRVAPV QVSFLGFPGT
TAIAEMDYMI ADRITIPDGD ERFYSEKILR LPGCYQPNTN SRILPEGAGG RAAFGLPDDR
FVFASFNHPH KVGPSEFATW MEILREVPQA VLLFYSGKAD LGAALAERAQ AHGVEPSRVL
ACGPLPQTAH LERIAQVDLC LDCFAYNAHT TASDAVWAGV PLLTLAGRQF AARVATSILS
TAGVPELSTV STKDYVAKAV HLATHPEDLL ALRQKISAAR HGSPLFDTKR WTRDYEALLQ
MCYQRHRAGE APDHMSLSDG