Gene TM1040_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1104 
Symbol 
ID4077811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1185278 
End bp1187128 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content57% 
IMG OID638006408 
ProductTPR repeat-containing protein 
Protein accessionYP_613099 
Protein GI99080945 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCAGC CCAATCAAAC CCCAAAACTC GCGGTGATGC AGCCCACCAA GAAGCAGTTC 
CCCGAGTTCG AAAAATTCGC CAATTCCGTC AAGGCGCAGG AACGCGCGAA ATTCGCCTAT
GAAATGAAAA ACGGGCTGCG CAGCAGCCGG GCGAAAGGGG AGCAGACCAA GGACACTGTG
ACCAAGCTTC TGCGCAAGGC TCAGGCGCTC ATCCTGAAAC ATGAGAACGA AAAGGCCGAG
GCGGTGATCA ACCAGGCGAT TGCGCTCGAG GGACAGGATG CGCGGCTCCA TTCGCTGATG
GGCGAAGTGC TCATGAAGGA TTCCAACCGG ATGATGGACG CGCTGGGGAG CCTGATGCGA
GCCGTCAAGC TGGAGCCGAC CAACGGCAGC CACTACGGCA TGATCGGCAC CCTTCTGATG
CGCCTTCAGA AATTTGAAGA GGCGATCGAT TACTTTGAAA TTGCTGTGAA ATTCGATCCC
AAAAATCATA TCGCCCTGTC GCGGATGATG CATACCAAAG CCCATCGCGC GCGCTGGGAT
GATTTCAACA AGATCCCGAC CTACCTCAAG CAGTTCAAGA ACCAGAACGT GCTGTCCGAT
CCTTTTGCCT TCCTGTCGCT TTGCGATGAT GCGGCGTTTC AGAAACAGCG CTCCATCGCC
CAGATCAATT CCAAGTTCTG CAACCCCGTC AAAGCCCCCA TCTTCAAGGG GGAGCGCGCG
GCCGGCGAAA AGATCCGGAT CGGCTATTTC TCGAACGACT TCTACAACCA CGCCACCATG
CATCTCATGG GGGGGCTGCT GGAAAACCAC GACCGATCGA AGTTCGAGAT CTATATCTAC
GACTATGGCT CCAAGCTGCG GGACCACGAA CACGAGCGCG CGCGTCGCAG CGCCGATGTG
TTTCGCGATA TCCGCACTCT GAACACTGCG CAGATCGTTG ATCTGGCGCA TGGGGATGCT
CTGGATATTG CGGTGGATCT CAAGGGCTTT ACCGAGAATG GGCGGCTGGA CATGTTCAAC
AGTCGCGTGG CGCCTGTGCA GGTGGCCTAT CTGGGGTATC CGGGCACGAC CGGTTTGAAA
TCAATGGATT ACATGGTGGC CGACAAGATC ACGATCCCGT CGCACCTGCG CAAGCACTAT
ACCGAAAACA TCCTCTATAT GCCCAATTGC TACCAGCCCA ATGACGAGTC GCGCTTTATC
GCCGAGGTTG CAGACACGCG GGCCAGCCAT GATCTCCCCG AAGAGGGGTT TGTGTTCTCC
TCCTTCAACA ACCCCTACAA GGTCACGCCG CGCGAGTTCG GCATCTGGAT GGACCTCCTG
AAAGAAGTGC CCGACAGCGT GCTGTGGTTC TATGTCTCAA AGGCAGAGAT CATCGACCGT
CTGCGCAAGG AAGCCGAGTC GCGCGGTGTC GATGGTGCGC GCATCATCCC CACCGGGCGG
ATGCAGCCGG AGTACCACCT GGCGCGCCTG AAACACGCCG ATCTTTTCCT GGATACCTTC
AACGTCAACG CACACACGAC CGCCAGTGAT GCACTCTGGG CGGGCTTGCC CGTTGTCACC
AAAACCGGCG AGCAATTCGC CGCGCGGGTG GCGGGCAGCA TCCTCAGCGC AGCGGGGCTT
GAGGATCTGG TCACTCATAG CGAGAAGAAA TATTACGAAG TGGCTCTGCG CATTGCCCAA
GATCCTGACT ATCTTGCGGA TATTCGCAAG CGTCTGGCGG CGTCGCACGA GAACTCGCCG
CTGTTTGATA CCAAATCCTA CACCCGCGAT TTTGAACGTC TGATGGAGCG CGCGTTCCAG
AATTACATCG ACGGCAACGC CCCGCGCAGT CTCGGAATTT CCGCAGCCTG A
 
Protein sequence
MSQPNQTPKL AVMQPTKKQF PEFEKFANSV KAQERAKFAY EMKNGLRSSR AKGEQTKDTV 
TKLLRKAQAL ILKHENEKAE AVINQAIALE GQDARLHSLM GEVLMKDSNR MMDALGSLMR
AVKLEPTNGS HYGMIGTLLM RLQKFEEAID YFEIAVKFDP KNHIALSRMM HTKAHRARWD
DFNKIPTYLK QFKNQNVLSD PFAFLSLCDD AAFQKQRSIA QINSKFCNPV KAPIFKGERA
AGEKIRIGYF SNDFYNHATM HLMGGLLENH DRSKFEIYIY DYGSKLRDHE HERARRSADV
FRDIRTLNTA QIVDLAHGDA LDIAVDLKGF TENGRLDMFN SRVAPVQVAY LGYPGTTGLK
SMDYMVADKI TIPSHLRKHY TENILYMPNC YQPNDESRFI AEVADTRASH DLPEEGFVFS
SFNNPYKVTP REFGIWMDLL KEVPDSVLWF YVSKAEIIDR LRKEAESRGV DGARIIPTGR
MQPEYHLARL KHADLFLDTF NVNAHTTASD ALWAGLPVVT KTGEQFAARV AGSILSAAGL
EDLVTHSEKK YYEVALRIAQ DPDYLADIRK RLAASHENSP LFDTKSYTRD FERLMERAFQ
NYIDGNAPRS LGISAA